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GOODPASTURE ANTIGEN BINDING PROTEIN 

Cross Reference 

This application claims priority to U.S. Provisional Patent Application Serial No. 
60/121,483, filed February 24, 1999. 

Statement of Government Rights 

This work was supported in part by Grants SAL91/0513, SAF94/1051 and 
S AF97/0065 from the Plan Nacional I+D of the Comision Interministerial de Ciencia 
Tecnologia (CICYT, Spain), Grant 93/0343 from Fondo de Investigaciones Sanitarias 
(FISss, Spain) and Grants GV-3 166/95, GV-C-VS-21-1 18-96 from la Direcci6 General 
d'Ensenyaments Universitaris i Investigacio (Comunitat Valenciana, Spain); therefore 
the State of Spain may have rights in the invention. 

Field of the Invention 

The invention relates to the fields of protein kinases, automimmune disease, 
apoptosis, and cancer. 

Background of the Invention 

Goodpasture (GP) disease is an autoimmune disorder described only in humans. In 
GP patients, autoantibodies against the non-collagenous C-terminal domain (NCI) of the 
type IV collagen <x3 chain ("Goodpasture antigen") cause a rapidly progressive 
glomerulonephritis and often lung hemorrhage, the two cardinal clinical manifestations of 
the GP syndrome (see 1 for review. The reference numbers in this section correspond to 
reference list of Example 1). 

The idea that common pathogenic events exist at least for some autoimmune 
disorders is suggested by the significant number of patients displaying more than one 
autoimmune disease, and also by the strong and common linkage that some of these 
diseases show to specific MHC haplotypes (31, 32). The experimental observation that the 
autoantigen is the leading moiety in autoimmunity and that a limited number of self- 
components are autoantigenic (31), suggest that these self-components share biological 
features with important consequences in self/non-self recognition by the immune system. 
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One possibility is that triggering events, by altering different but specific self-components, 
would result in abnormal antigen processing. In certain individuals expressing a particular 
MHC specificity, the abnormal peptides could be recognized by non-tolerized T cells and 
trigger an immune response (1). 

5 We have previously explored the GP antigen to identify biological features of 

relevance in autoimmune pathogenesis. Since the NCI domain is a highly conserved 
domain among species and between the different type IV collagen a chains (ccl-<x6) (2), 
the exclusive involvement of the human a3(IV)NCl in a natural autoimmune response 
suggests that this domain has structural and/or biological peculiarities of pathogenic 

10 relevance. Consistent with this, the N-terminus of the human antigen is highly divergent, 
and it contains a unique five-residue motif (KRGDS 0 ) that conforms to a functional 
phosphorylation site for type A protein kinases (3, 4). Furthermore, the human a3 gene, 
but not the other related human or homologous genes from other species, is alternatively 
spliced and generates multiple transcripts also containing the phosphorylatable N-terminal 

1 5 region (5-7). Recent studies indicate that the phosphorylation of the N-terminus of the GP 
antigen by cAMP-dependent protein kinase is up regulated by the presence of the 
alternative products (see Example 3 below). Specific serine phosphorylation and pre- 
mRNA alternative splicing are also associated with the biology of other autoantigens 
including the acetylcholine receptor and myelin basic protein (MBP) (4). The latter is 

20 suspected to be the major antigen in multiple sclerosis (MS), another exclusively human 
autoimmune disease in which the immune system targets the white matter of the central 
nervous system. GP disease and MS are human disorders that display a strong association 
with the same HLA class II haplotype (HLA DRB1* 1501)(32, 33). This, along with the 
recent report of death by GP disease of an MS patient carrying this HLA specificity (34), 

25 supports the existence of common pathogenic events in these human disorders. 

Thus, specific serine/threonine phosphorylation may be a major biological 
difference between the human GP antigen, the GP antigens of other species, and the 
homologous domains from the other human a(IV) chains, and might be important in 
pathogenesis (1, 4). 

30 Therefore, the identification and isolation of the specific serine/threonine kinase 

that phosphorylates the N-terminal region of the human GP antigen would be very 
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advantageous for the diagnosis and treatment of GP syndrome, and possibly for other 
autoimmune disorders. 

5 Summary of the Invention 

The present invention fulfills the need in the art for the identification and isolation 
of a serine/threonine kinase that specifically binds to and phosphorylates the unique N- 
terminal region of the human GP antigen. In one aspect, the present invention provides 
nucleic acid sequences encoding various forms of the Goodpasture antigen binding protein 
10 (GPBP), as well as recombinant expression vectors operatively linked to the GPBP- 
encoding sequences. 

In another aspect, the present invention provides host cells that have been 
transfected with the recombinant expression vectors. In a further aspect, the present 
invention provides substantially purified GPBP and antibodies that selectively bind to 
15 GPBP. In still further aspect, the invention provides methods for detecting the presence of 
GPBP or nucleic acids encoding GPBP. 

In a further aspect, the present invention provides methods for detecting the 
presence of an autoimmune condition or apoptosis, which comprises detecting an 
increase in the expression of GPBP in a tissue compared to a control tissue. 
20 In another aspect, the present invention provides methods and pharmaceutical 

compositions for treating an autoimmune disorder, apoptosis, or a tumor, comprising 
. modifying the expression or activity of GPBP in a patient in need thereof. 

Brief Description of the Figures 

25 Figure 1. Nucleotide and derived amino acid sequences of n4\ The denoted 

structural features are from 5' to 3' end: the cDNA present in the original clone (HeLal) 
(dotted box), which contains the PH homology domain (in black) and the Ser-Xaa-Yaa 
repeat (in gray); the heptad repeat of the predictable coiled-coil structure (open box) 
containing the bipartite nuclear localization signal (in gray); and a serine-rich domain 

30 (filled gray box). The asterisks denote the positions of in frame stop codons. 

Figure 2. Distribution of GPBP in human tissues (Northern blot) and in 
eukaryotic species (Southern blot). A random primed 32 P-labeled HeLal cDNA probe 
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was used to identify homologous messages in a Northern blot of poly(A*)RNA from 
the indicated human tissues (panel A) or in a Southern blot of genomic DNA from the 
indicated eukaryotic species (panel B). Northern hybridization was performed under 
highly stringent conditions to detect perfect matching messages and at low stringency 
5 in the Southern to allow the detection of messages with mismatches. No appreciable 
differences in the quality and amount of each individual poly A+ RNA was observed 
by denaturing gel electrophoresis or when probing a representative blot from the same 
lot with human p-actin cDNA. The numbers denote the position and the sizes in kb of 
the RNA or DNA markers used. 

10 Figure 3. Experimental determination of the translation start site. In (A), the 

two cDNAs present in pc-n4' and pc-FLAG-n4' plasmids used for transient expression 
are represented as black lines. The relative position of the corresponding predicted 
(n4') or engineered (FLAG-n4') translation start site is indicated (Met). In (B), the 
extracts from control (-), pc-n4\n4') or pc-FLAG-n4' (FLAG-n4*) transfected 293 

15 cells were subjected to SDS-PAGE under reducing conditions in 10% gels. The 
separated proteins were transferred to a PVDF membrane (Millipore) and blotted with 
the indicated antibodies. The numbers and bars indicate the molecular mass in kDa and 
the relative positions of the molecular weight markers, respectively. 

Figure 4. Characterization of rGPBP from yeast and 293 cells. In (A), 1 \ig 

20 (lane 1) or 100 ng (lanes 2 and 3) of yeast rGPBP were analyzed by reducing SDS- 
PAGE in a 10% gel. The separated proteins were stained with Coomassie blue (lane 1) 
or transferred and blotted with anti-FLAG antibodies (lane 2) or Mabl4, a monoclonal 
antibody against GPBP (lane 3). In (B), the cell extracts from GPBP-expressing yeast 
were analyzed as in A and blotted with anti-FLAG (lane 1), anti-PSer (lane 2), anti- 

25 PThr (lane 3) or anti-PTyr (lane 4) monoclonal antibodies respectively. In (C), 200 ng 
of either yeast rGPBP (lane 1), dephosphorylated yeast rGPBP (lane 2) or 293 cells- 
derived rGPBP (lane 3) were analyzed as in B with the indicated antibodies. In (D), 
similar amounts of H3 32 P04-labeled non-transfected (lanes 1), stable pc-n4' transfected 
(lanes 2) or transient pc-FLAG-n4' expressing (lanes 3) 293 cells were lysed, 

30 precipitated with the indicated antibodies and analyzed by SDS-PAGE and 
autoradiography. The molecular weight markers are represented with numbers and bars 
as in Figure 3. The arrows indicate the position of the rGPBP. 
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Figure 5. Recombinant GPBP contains a serine/threonine kinase that 
specifically phosphorylates the N-terminal region of the human GP antigen. To 

assess phosphorylation, approximately 200 ng of yeast rGPBP was incubated with 
[y] 32 P-ATP in the absence (A and B) or presence of GP antigen-derived material (C). In 

5 (A), the mixture was subjected to reducing SDS-PAGE (10% gel) and 
autoradiographed. In (B), the mixture was subjected to 32 P-phosphoamino acid analysis 
by two-dimensional thin-layer chromatography. The dotted circles indicate the position 
of ninhydrin stained phosphoamino acids. In (C), the phosphorylation mixtures of the 
indicated GP -derived material were analyzed by SDS-PAGE (15% gel) and 

10 autoradiography (GPpepl and GPpeplAla 9 ) or immunoprecipitated with Mab 17, a 
monoclonal antibody that specifically recognize GP antigen from human and bovine 
origin, and analyzed by SDS-PAGE (12.5%) and autoradiography (rGP, GP). The 
relative positions of rGPBP (A), rGP antigen and the native human and bovine GP 
antigens (C) are indicated by arrows. The numbers and bars refer to molecular weight 

15 markers as in previous Figures. 

Figure 6. In-blot renaturation of the serine/threonine kinase present in 
rGPBP. Five micrograms of rGPBP from yeast were in-blot renatured. The 
recombinant material was specifically identified by anti-FLAG antibodies (lane 1) and 
the in situ 32 P-incorporation detected by autoradiography (lane 2). The numbers and 

20 bars refer to molecular weight markers as in previous Figures. The arrow indicates the 
position of the 89 kDa rGPBP polypeptide. 

Figure 7. Immunological localization of GPBP in human tissues. Rabbit 
serum against the N-tenninal region of GPBP (1:50) was used to localize GPBP in 
human tissues. The tissues shown are kidney (A) glomerulus (B), lung (C), alveolus 

25 (D), liver (E), brain (F), testis (G), adrenal gland (H), pancreas (I) and prostate (J). 
Similar results were obtained using anti-GPBP affinity-purified antibodies or a pool of 
culture medium from seven different GPBP-specific monoclonal antibodies (anti-GPBP 
Mabs 3, 4, 5, 6, 8, 10 and 14). Rabbit pre-immune serum did not stain any tissue 
structure in parallel control studies. Magnification was 40X except in B and D where it 

30 waslOOX. 

Figure 8. GPBPA26 is a splicing variant of GPBP. (A) Total RNA from 
normal skeletal muscle was retrotranscribed using primer 53c and subsequently 
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subjected to PCR with primers llm-53c (lane 2) or 15m-62c {lane 4). Control 
amplifications of a plasmid containing GPBP cDNA using the same pairs of primers 
are shown in lanes 1 and J. Numbers on the left and right refer to molecular weight in 
base pairs. The region missing in the normal muscle transcript was identified and its 

5 nucleotide sequence {lower case) and deduced amino acid sequence {upper case) are 
shown in (B). A clone of genomic DNA comprising the cDNA region of interest was 
sequenced and its structure is drawn in (C), showing the location and relative sizes of 
the 78-bp exon spliced out in GPBPA26 {black box), adjacent exons {gray boxes), and 
introns {lines). The size of both intron and exons is given and the nucleotide sequence 

10 of intron-exon boundaries is presented, with consensus for 5' and 3' splice sites shown 
in bold case. 

Figure 9. Differential expression of GPBP and GPBPA26. Fragments 
representing the 78-bp exon (GPBP) or flanking sequences common to both isoforms 
(GPBP/GPBPA26) were 32 P-labeled and used to hybridize human tissue and tumor cell 

15 line Northern blots (CLONTECH). The membranes were first hybridized with GPBP- 
specific probe, stripped and then reanalyzed with GPBP/GPBPA26 probe. Washing 
conditions were less stringent for GPBP-specific probe (0.1% SSPE, 37°C or 55°C) 
than for the GPBP/GPBPA26 (0.1% SSPE, 68°C) to increase GPBP and GPBPA26 
signals respectively. No detectable signal was obtained for the GPBP probe when the 

20 washing program was at 68°C (not shown). 

Figure 10. GPBPA26 displays lower phosphorylating activity than GPBP. 

(A) Recombinantly-expressed, affinity-purified GPBP (rGPBP) (lanes 1) or rGPBPA26 
(lanes 2) were subjected to SDS-PAGE under reducing conditions and either Coomasie 
blue stained (2 jig per lane) or blotted (200ng per lane) with monoclonal antibodies 

25 recognizing the FLAG sequence (ct-FLAG) or GPBP/GPBPA26 (Mabl4). (B) 200 ng 
of rGPBP (lanes 1) or rGPBPA26 (lanes 2) were in vitro phosphorylated without 
substrate to assay auto-phosphorylation (left), or with 5 nmol GPpepl to measure trans- 
phosphorylation activity (right). An arrowhead indicates the position of the peptide. (C) 
3 jig of rGPBP (lane 1) or rGPBPA26 (lane 2) were in-blot renatured as described 

30 under Material and Methods. The numbers and bars indicate the molecular mass in kDa 
and the relative position of the molecular weight markers, respectively. 
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Figure 11. rGPBP and rGPBPA26 form very active high molecular weight 
aggregates. About 300 ng of rGPBP (A) or rGPBPA26 (B) were subjected to gel 
filtration HPLC as described under Material and Methods. Vertical arrowheads and 
numbers respectively indicate the elution profile and molecular mass (kDa) of the 
5 molecular weight standards used. Larger aggregates eluted in the void volume (I), and 
the bulk of the material present in the samples eluted in the fractionation range of the 
column as a second peak between the 669 and 158 kDa markers (II). Fifteen microliters 
of the indicated minute fractions were subjected to SDS-PAGE and Coomasie blue 
staining. Five microliters of the same fractions were in vitro phosphorylated as 
10 described in Materials and Methods, and the reaction stopped by boiling in SDS sample 
buffer. The fractions were loaded onto SDS-PAGE, transferred to PVDF and 
autoradiographed for 1 or 2 hours using Kodak X-Omat films and blotted using anti- 
FLAG monoclonal antibodies (Sigma). 

Figure 12. Self-interaction of GPBP and GPBPA26 assessed by a yeast two- 
15 hybrid system. (A) Cell transfected for the indicated combinations of plasmids were 
selected on leucine-tryptophan-deficient medium (-Trp, -Leu), and independent 
transformants restreaked onto histidine-deficient plates {-Trp, -Leu, -His) in the 
presence or absence of 1 mM 3-amino-triazole (3-AT), to assess interaction. The 
picture was taken 3 days after streaking. (B) The bars represent mean values in p- 
20 galactosidase arbitrary units of four independent p-galactosidase in-solution assays. 

Figure 13. GPBP is expressed associated with endothelial and glomerular 
basement membranes. Paraffin embedded sections of human muscle (A) or renal 
cortex (B, C) were probed with GPBP-specific antibodies (A,B) or with Mabl89, a 
monoclonal antibody specific for the human a3(IV)NCl (C). Frozen sections of human 
25 kidney (D-F) were probed with Mabl7, a monoclonal antibody specific for the 
a3(IV)NCl domain (D), GPBP-specific antibodies (E), or sera from a GP patient (F). 
Control sera (chicken pre-immune and human control) did not display tissue-binding in 
parallel studies (not shown). 

Figure 14. GPBP is expressed in human but not in bovine and murine renal 
30 cortex. Cortex from human (A, D), bovine (B, E) or murine (C, F) kidney were paraffin 



WO 00/50607 



8 



PCT/IBOO/00324 



embedded and probed with either GPBP-specific antibodies (A-C) or GPBP/GPBPA26- 
specific antibodies (D-F). 

Figure 15. GPBP is highly expressed in several autoimmune conditions. 
Skeletal muscle total RNA from a control individual (lane 1) or from a GP patient (lane 
5 2) was subjected to RT-PCR as in Fig.8, using the oligonucleotides 15m and 62c in the 
amplification program. Frozen (B-D) or paraffin embedded (E-G) human control skin 
(B, E) or skin affected by SLE (C, F) or lichen planus (D, G) were probed with GPBP- 
specific antibodies. 

Figure 16. Phosphorylation of GP alternative splicing products by PKA. In 

10 left panel, equimolecular amounts of rGP (lanes 1), rGPAV (lanes 2), rGPAIII (lanes 3) 
or rGPAIII/IV/V (lanes 4), equivalent to 500 ng of the GP were phosphorylated at the 
indicated ATP concentrations. One-fifth of the total phosphorylation reaction mixture 
was separated by gel electrophoresis and transferred to PVDF, autoradiographed 
(shown) and the proteins blotted with M3/1, a specific monoclonal antibody 
15 recognizing all four species (shown) or using antibodies specific for each individual C- 
terminal region (not shown). Arrowheads indicate the position of each recombinant 
protein, from top to bottom, GP, GPAV and, GPAIII -GPAIII/IV/V which displayed the 
same mobilities. Right panel: purified a3(IV)NCl domain or hexamer was 
phosphorylated with PKA and 0.1 yM ATP in the absence (lanes 1) or in the presence 
20 of 10 nmol of peptides representing the C-terminal region of either GPAIII (lanes 2) or 
. GPAIILTV7V (lanes 3). Where indicated the phosphorylation mixtures of purified 
a3(IV)NCl domain were V8 digested and immunoprecipitated with antibodies specific 
for the N terminus of the human a3(IV)NCl domain (3). Bars and numbers indicate the 
position and sizes (kDa) of the molecular weight markers. 
25 Figure 17. Sequence alignment of GPAIII and MBP. The phosphorylation 

sites for PKA (boxed) and the structural similarity for the sites at Ser 8 and 9 of MBP 
and GPAIII respectively are shown (underlined). The identity (vertical bars) and 
chemical homology (dots) of the corresponding exon II (bent arrow) of both molecular 
species are indicated. The complete sequence of GPAIII from the collagenase cleavage 
30 site (72-residues) is aligned with the 69-N terminal residues of MBP comprising the 
exon I and ten residues of the exon II. 
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Figure 18. Phosphorylation of rec mbinant MBP proteins by PKA. About 
200 ng of rMBP (lane 1), or Ser to Ala mutants thereof in position 8 (lane 2) or 57 (lane 
3), or rMPBAII (lane 4) or Ser to Ala mutants thereof in position 8 (lane 5) or 57 (lane 
6), were phosphorylated by PKA and 0.1 }iM ATP. The mixtures were subjected to 
5 SDS-PAGE, transferred to PVDF and autoradiographed (Phosphorylation) and the 
individual molecular species blotted with monoclonal antibodies against human MBP 
obtained from Roche Molecular Biochemicals (Western). 

Figure 19. Phosphorylation of recombinant MBP proteins by GPBP. About 
200 ng of rMBP (lane 1), or Ser to Ala mutants thereof in positions 8 (lane 2) or 57 
10 (lane 3), or rMPBAII (lane 4), or Ser to Ala mutants thereof in positions 8 (lane 5) or 57 
(lane 6), were subjected to SDS-PAGE, transferred to PVDF, and the area containing 
the proteins visualized with Ponceau and stripped out. The immobilized proteins were 
in situ phosphorylated with rGPBP as described in Materials and Methods, 
autoradiographed (Phosphorylation) and subsequently blotted as in Fig. 18 (Western). 

15 

Figure 20. Regulation of the GPBP by the C terminal region of GPAII1. 

About 200 ng of rGPBP were in vitro phosphorylated with 150 |j.M ATP in the absence 
(lane 1) or in the presence of 5 nmol of GPAIII -derived peptide synthesized either using 
Boc- (lane 2) or Fmoc- (lane 3) chemistry. The reaction mixtures were subjected to 
20 SDS-PAGE, transferred to PVDF and autoradiographed to asses autophosphorylation, 
and subsequently blotted with anti-FLAG monoclonal antibodies (Sigma) to determine 
the amount of recombinant material present (Western). 

25 Detailed Description of the Invention 

All references cited are herein incorporated by reference in their entirety. 

The abbreviations used herein are: bp, base pair; DTT, dithiothreitol; DMEM, 
Dulbecco's modified Eagle's medium; EDTA, ethylenediamine tetraacetic acid; EGTA, 
ethylene glycol-bis(P-aminoethyl ether) N,N,N\N'-tetraacetic acid; GP, Goodpasture; 
30 rGPAIII, rGPAIII/IV/V and rGPAV, recombinant material representing the alternative 
forms of the Goodpasture antigen resulting from splicing out exon III, exon HI, IV and 
V or exon V, respectively; GPBP and rGPBP, native and recombinant Goodpasture 
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antigen binding protein; GPBPA26 and rGPBPA26, native and recombinant alternative 
form of the GPBP; GST, glutathione S-transferase; HLA, human lymphocyte antigens; 
HPLC, high performance liquid chromatography; Kb, thousand base pairs; kDa, 
thousand daltons; MBP, rMBP, native and recombinant 21 kDa myelin basic protein; 
5 MBP AH and rMBP All, native and recombinant 18.5 kDa myelin basic protein that 
results from splicing out exon II; MBPAV and MBPAII/V, myelin basic protein 
alternative forms resulting from splicing out exon V and exons II and V, respectively; 
MHC, major histocompatibility complex; NCI, non-collagenous domain; PH, 
pleckstrin homology; PKA, cAMP-dependent protein kinase; PMSF, 
io phenylmethylsulfonyl fluoride; SDS-PAGE, sodium dodecylsulfate polyacrylamide gel 
electrophoresis; TBS, tris buffered saline. 

Within this application, unless otherwise stated, the techniques utilized may be 
found in any of several well-known references such as: Molecular Cloning: A 
Laboratory Manual (Sarabrook, et al., 1989, Cold Spring Harbor Laboratory Press), 
15 Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. 
Goeddel, 1991. Academic Press, San Diego, CA), "Guide to Protein Purification" in 
Methods in Enzymology (M.P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR 
Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press, 
San Diego, CA), Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed. (R.I. 
20 Freshney. 1987. Liss, Inc. New York, NY), Gene Transfer and Expression Protocols, 
pp. 109-128, ed. EJ. Murray, The Humana Press Inc., Clifton, NJ.), and the Ambion 
1998 Catalog (Ambion, Austin, TX). 

As used herein, the term "GPBP" refers to Goodpasture binding protein, and 
includes both monomers and oligomers thereof. Human (SEQ ID NO:2), mouse (SEQ 
25 ID NO:4), and bovine GPBP sequences (SEQ ID NO:6) are provided herein. 

As used herein, the term "GPBPA26" refers to Goodpasture binding protein 
deleted for the 26 amino acid sequence shown in SEQ ID NO: 14, and includes both 
monomers and oligomers thereof. Human (SEQ ID NO:8), mouse (SEQ ID NO: 10), 
and bovine GPBP sequences (SEQ ID NO: 12) are provided herein. 
30 As used herein the term "GPBPpepl" refers to the 26 amino acid peptide 

shown in SEQ ID NO: 14, and includes both monomers and oligomers thereof. 
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As used herein, the term "GP antigen" refers to the <x3 NCI domain of type IV 
collagen. 

As used herein, "MBP" refers to myelin basic protein. 

In one aspect, the present invention provides isolated nucleic acids that encode 
5 GPBP, GPBPA26, and GPBPpepl, and mutants or fragments thereof. In one 
embodiment, the isolated nucleic acids comprise sequences substantially similar to SEQ 
ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO: 17, SEQ ID NO:19, SEQ ID NO:21, SEQ 
ID NO:23, or SEQ ID NO:25, or fragments thereof. 
10 In another aspect, the present invention provides isolated nucleic acids that encode 

alternative products of the GP antigen or MBP. In one embodiment, the isolated nucleic 
acids comprise sequences that encode peptides substantially similar to SEQ ID NO:43 and 
SEQIDNO:44. 

The phrase "substantially similar " is used herein in reference to the nucleotide 

15 sequence of DNA or RNA, or the amino acid sequence of protein, having one or more 
conservative or non-conservative variations from the disclosed sequences, including but 
not limited to deletions, additions, or substitutions, wherein the resulting nucleic acid 
and/or amino acid sequence is functionally equivalent to the sequences disclosed 
herein. Functionally equivalent sequences will function in substantially the same 

20 manner to produce substantially the same protein disclosed herein. For example, 
functionally equivalent DNAs encode proteins that are the same as those disclosed 
herein or that have one or more conservative amino acid variations, such as substitution 
of a non-polar residue for another non-polar residue or a charged residue for a similarly 
charged residue. These changes include those recognized by those of skill in the art as 

25 substitutions that do not substantially alter the tertiary structure of the protein. 

In practice, the term substantially similar means that DNA encoding two 
proteins hybridize to one another under conditions of moderate to high stringency, and 
encode proteins that have either the same sequence of amino acids, or have changes in 
sequence that do not alter their structure or function. As used herein, substantially 

30 similar sequences of nucleotides or amino acids share at least about 70% identity, more 
preferably at least about 80% identity, and most preferably at least about 90% identity. 
It is recognized, however, that proteins (and DNA or mRNA encoding such proteins) 
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containing less than the above-described level of homology arising as splice variants or 
that are modified by conservative amino acid substitutions (or substitution of 
degenerate codons) are contemplated to be within the scope of the present invention. 

Stringency of hybridization is used herein to refer to conditions under which 

5 nucleic acid hybrids are stable. As known to those of skill in the art, the stability of 
hybrids is reflected in the melting temperature (T M ) of the hybrids. T M decreases 
approximately M.5°C with every 1% decrease in sequence homology. In general, the 
stability of a hybrid is a function of sodium ion concentration and temperature. 
Typically, the hybridization reaction is performed under conditions of lower stringency, 

10 followed by washes of varying, but higher, stringency. Reference to hybridization 
stringency relates to such washing conditions. Thus, as used herein, moderate 
stringency refers to conditions that permit hybridization of those nucleic acid sequences 
that form stable hybrids in 0.1% SSPE at 37°C or 55°C, while high stringency refers to 
conditions that permit hybridization of those nucleic acid sequences that form stable 

1 5 hybrids in 0. 1%SSPE at 65°C. It is understood that these conditions may be duplicated 
using a variety of buffers and temperatures and that they are not necessarily precise. 
Denhardt's solution and SSPE (see, e.g., Sambrook, Fritsch, and Maniatis, in: 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989) 
are well known to those of skill in the art, as are other suitable hybridization buffers. 

20 The isolated nucleic acid sequence may comprise an RNA, a cDNA, or a 

genomic clone with one or more introns. The isolated sequence may further comprise 
additional sequences useful for promoting expression and/or purification of the encoded 
protein, including but not limited to polyA sequences, modified Kozak sequences, and 
sequences encoding epitope tags, export signals, and secretory signals, nuclear 

25 localization signals, and plasma membrane localization signals. 

In another aspect, the present invention provides recombinant expression vectors 
comprising nucleic acid sequences that express GPBP, GPBPA26, or GPBPpepl, and 
mutants or fragments thereof. In one embodiment, the vectors comprise nucleic acid 
sequences that are substantially similar to the sequences shown in SEQ ID NO:l, SEQ ED 

30 NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, 
SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, or 
SEQ ID NO:25, or fragments thereof. 
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In another aspect, the present invention provides recombinant expression vectors 
comprising nucleic acid sequences that express peptides that are substantially similar to 
the amino acid sequence shown in SEQ ID NO:43, SEQ ID NO:44, or peptide fragments 
thereof. 

5 "Recombinant expression vector" includes vectors that operatively link a 

nucleic acid coding region or gene to any promoter capable of effecting expression of 
the gene product. The promoter sequence used to drive expression of the disclosed 
nucleic acid sequences in a mammalian system may be constitutive (driven by any of a 
variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or 
10 inducible (driven by any of a number of inducible promoters including, but not limited 
to, tetracycline, ecdysone, steroid-responsive). The construction of expression vectors 
for use in transfecting prokaryotic cells is also well known in the art, and thus can be 
accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and 
Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
15 Laboratory Press, 1989; Gene Transfer and Expression Protocols, pp. 109-128, ed, EJ. 
Murray, The Humana Press Inc., Clifton, NJ.), and the Ambion 1998 Catalog 
(Ambion, Austin, TX) 

The expression vector must be replicable in the host organisms either as an 
episome or by integration into host chromosomal DNA. In a preferred embodiment, 
20 the expression vector comprises a plasmid. However, the invention is intended to 
include other expression vectors that serve equivalent functions, such as viral vectors. 

In a further aspect, the present invention provides host cells that have been 
transfected with the recombinant expression vectors disclosed herein, wherein the host 
cells can be either prokaryotic or eukaryotic. The cells can be transiently or stably 
25 transfected. Such transfection of expression vectors into prokaryotic and eukaryotic 
cells can be accomplished via any technique known in the art, including but not limited 
to standard bacterial transformations, calcium phosphate co-precipitation, 
electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic 
mediated-, or viral mediated transfection. (See, for example, Molecular Cloning: A 
30 Laboratory Manual (Sambrook, et al, 1989, Cold Spring Harbor Laboratory Press; 
Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed. (ILL Freshney. 1987. 
Liss, Inc. New York, NY), 
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In a still further aspect, the present invention provides substantially purified 
GPBP, GPBPA26, and GPBPpept, and mutants or fragments thereof. In one 
embodiment, the amino acid sequence of the substantially purified protein is substantially 
similar to SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, 

5 SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ED NO:18, SEQ ID NO:20, SEQ 
ID NO:22, SEQ ID NO:24, or peptide fragments thereof. 

In another aspect, the present invention provides substantially purified alternative 
products of the GP antigen and MBP. In one embodiment, the amino acid sequence of the 
substantially purified polypeptide is substantially similar to SEQ ID NO:43, SEQ ID 

1 0 NO:44, or peptide fragments thereof. 

As used herein, the term "substantially purified" means that the protein has been 
separated from its in vivo cellular environments. Thus, the protein can either be 
purified from natural sources, or recombinant protein can be purified from the 
transfected host cells disclosed above. In a preferred embodiment, the proteins are 

15 produced by the transfected cells disclosed above, and purified using standard techniques. 
(See for example, Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, 
Cold Spring Harbor Laboratory Press.)) The protein can thus be purified from 
prokaryotic or eukaryotic sources. In various further preferred embodiments, the 
protein is purified from bacterial, yeast, or mammalian cells. 

20 The protein may comprise additional sequences useful for promoting 

purification of the protein, such as epitope tags and transport signals. Examples of such 
epitope tags include, but are not limited to FLAG (Sigma Chemical, St. Louis, MO), 
myc (9E10) (Invitrogen, Carlsbad, CA), 6-His (Invitrogen; Novagen, Madison, WI), 
and HA (Boehringer Manheim Biochemicals). Examples of such transport signals 

25 include, but are not limited to, export signals, secretory signals, nuclear localization 
signals, and plasma membrane localization signals. 

In another aspect, the present invention provides antibodies that selectively bind to 
GPBP, GPBPA26, or GPBPpepl. In one aspect, the antibodies selectively bind to a 
protein comprising a sequence selected from the group consisting of SEQ ID NO:2, SEQ 

30 ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, 
SEQ ID NO:l6, SEQ ID NO:l8, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, or 
peptide fragments thereof. Such antibodies can be produced by immunization of a host 



WO 00/50607 15 PCT/IBOO/00324 

animal with either the complete GPBP, or with antigenic peptides thereof. The antibodies 
can be either polyclonal or monoclonal. 

In another aspect, the present invention provides antibodies that selectively bind to 
a polypeptide comprising an amino acid sequence substantially similar to a sequence 

5 selected from the group consisting of SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO: 46, 
SEQ ID NO: 48, SEQ ID NO:50, SEQ ID NO:54, or antigenic fragments thereof The 
antibodies can be either polyclonal or monoclonal. 

Antibodies can be made by well-known methods, such as described in Harlow 
and Lane, Antibodies; A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 

10 Spring Harbor, N.Y., (1988). In one example, preimmune serum is collected prior to 
the first immunization. Substantially purified proteins of the invention, or antigenic 
fragments thereof, together with an appropriate adjuvant, is injected into an animal in 
an amount and at intervals sufficient to elicit an immune response. Animals are bled at 
regular intervals, preferably weekly, to determine antibody titer. The animals may or 

15 may not receive booster injections following the initial immunization. At about 7 days 
after each booster immunization, or about weekly after a single immunization, the 
animals are bled, the serum collected, and aliquots are stored at about -20° C. 
Polyclonal antibodies against the proteins and peptides of the invention can then be 
purified directly by passing serum collected from the animal through a column to which 

20 non-antigen-related proteins prepared from the same expression system without GPBP- 
related proteins bound. 

Monoclonal antibodies can be produced by obtaining spleen cells from the 
animal, (See Kohler and Milstein, Nature 256, 495-497 (1975)). In one example, 
monoclonal antibodies (mAb) of interest are prepared by immunizing inbred mice with 

25 the proteins or peptides of the invention, or an antigenic fragment thereof. The mice 
are immunized by the IP or SC route in an amount and at intervals sufficient to elicit an 
immune response. The mice receive an initial immunization on day 0 and are rested for 
about 3 to about 30 weeks. Immunized mice are given one or more booster 
immunizations of by the intravenous (IV) route. Lymphocytes, from antibody positive 

30 mice are obtained by removing spleens from immunized mice by standard procedures 
known in the art. Hybridoma cells are produced by mixing the splenic lymphocytes 
with an appropriate fusion partner under conditions which will allow the formation of 
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stable hybridomas. The antibody producing cells and fusion partner cells are fused in 
polyethylene glycol at concentrations from about 30% to about 50%. Fused hybridoma 
cells are selected by growth in hypoxanthine, thymidine and aminopterin supplemented 
Dulbecco's Modified Eagles Medium (DMEM) by procedures known in the art. 

5 Supernatant fluids are collected from growth positive wells and are screened for 
antibody production by an immunoassay such as solid phase immunoradioassay. 
Hybridoma cells from antibody positive wells are cloned by a technique such as the soft 
agar technique of MacPherson, Soft Agar Techniques, in Tissue Culture Methods and 
Applications, Kruse and Paterson, Eds., Academic Press, 1973. 

10 To generate such an antibody response, the proteins of the present invention are 

typically formulated with a pharmaceutically acceptable carrier for parenteral 
administration. Such acceptable adjuvants include, but are not limited to, Freund's 
complete, Freund's incomplete, alum-precipitate, water in oil emulsion containing 
Corynebacterium parvum and tRNA. The formulation of such compositions, including 

15 the concentration of the polypeptide and the selection of the vehicle and other 
components, is within the skill of the art. 

The term antibody as used herein is intended to include antibody fragments 
thereof which are selectively reactive with the proteins and peptides of the invention, or 
fragments thereof. Antibodies can be fragmented using conventional techniques, and 

20 the fragments screened for utility in the same manner as described above for whole 
antibodies. For example, F(ab')2 fragments can be generated by treating antibody with 
pepsin. The resulting F(ab f )2 fragment can be treated to reduce disulfide bridges to 
produce Fab' fragments. 

In a further . aspect, the invention provides methods for detecting the presence of 

25 the proteins or peptides of the invention in a protein sample, comprising providing a 
protein sample to be screened, contacting the protein sample to be screened with an 
antibody against the proteins or peptides of the invention, and detecting the formation 
of antibody-antigen complexes. The antibody can be either polyclonal or monoclonal 
as described above, although monoclonal antibodies are preferred. As used herein, the 

30 term "protein sample" refers to any sample that may contain the proteins or peptides of 
the invention, and fragments thereof, including but not limited to tissues and portions 
thereof, tissue sections, intact cells, cell extracts, purified or partially purified protein 
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samples, bodily fluids, nucleic acid expression libraries. Accordingly, this aspect of 
the present invention may be used to test for the presence of GPBP, GPBPA26, 
GPBPpepl, or alternative products of the GP antigen in these various protein samples 
by standard techniques including, but not limited to, immunolocalization, 
immunofluorescence analysis, Western blot analysis, ELISAs, and nucleic acid 
expression library screening, (See for example, Sambrook et al, 1989.) In one 
embodiment, the techniques may determine only the presence or absence of the protein 
or peptide of interest. Alternatively, the techniques may be quantitative, and provide 
information about the relative amount of the protein or peptide of interest in the sample. 
For quantitative purposes, ELISAs are preferred. 

Detection of immunocomplex formation between the proteins or peptides of the 
invention, or fragments thereof, and their antibodies or fragments thereof, can be 
accomplished by standard detection techniques. For example, detection of 
immunocomplexes can be accomplished by using labeled antibodies or secondary 
antibodies. Such methods, including the choice of label are known to those ordinarily 
skilled in the art. (Harlow and Lane, Supra). Alternatively, the polyclonal or 
monoclonal antibodies can be coupled to a detectable substance. The term "coupled" is 
used to mean that the detectable substance is physically linked to the antibody. 
Suitable detectable substances include various enzymes, prosthetic groups, fluorescent 
materials, luminescent materials and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase. Examples of suitable prosthetic-group complexes include 
streptavidinftiotin and avidin/biotin. Examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin. An example of a 
luminescent material includes luminol. Examples of suitable radioactive material 
include l2 V 3 V 5 S or 3 H. 

Such methods of detection are useful for a variety of purposes, including but not 
limited to detecting an autoimmune condition, identifying cells targeted for or 
undergoing apoptosis, immunolocalization of the proteins of interest in a tissue sample, 
Western blot analysis, and screening of expression libraries to find related proteins. 
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In yet another aspect, the invention provides methods for detecting the presence in 
a sample of nucleic acid sequences encoding the GPBP, GPBPA26, GPBPpepl, or 
alternative products of the GP antigen comprising providing a nucleic acid sample to be 
screened, contacting the sample with a nucleic acid probe derived from the isolated 
5 nucleic acid sequences of the invention, or fragments thereof, and detecting complex 
formation. 

As used herein, the term "sample" refers to any sample that may contain GPBP- 
related nucleic acid, including but not limited to tissues and portions thereof, tissue 
sections, intact cells, cell extracts, purified or partially purified nucleic acid samples, 

10 DNA libraries, and bodily fluids. Accordingly, this aspect of the present invention may 
be used to test for the presence of GPBP mRNA or DNA in these various samples by 
standard techniques including, but not limited to, in situ hybridization, Northern 
blotting, Southern blotting, DNA library screening, polymerase chain reaction (PCR) or 
reverse transcription-PCR (RT-PCR). (See for example, Sambrook et al, 1989.) In one 

15 embodiment, the techniques may determine only the presence or absence of the nucleic 
acid of interest. Alternatively, the techniques may be quantitative, and provide 
information about the relative amount of the nucleic acid of interest in the sample. For 
quantitative purposes, quantitative PCR and RT-PCR are preferred. Thus, in one 
example, RNA is isolated from a sample, and contacted with an oligonucleotide derived 

20 from the nucleic acid sequence of interest, together with reverse transcriptase under 
suitable buffer and temperature conditions to produce cDNAs from the GPBP-rclated 
. RNA. The cDNA is then subjected to PCR using primer pairs derived from the nucleic 
acid sequence of interest. In a preferred embodiment, the primers are designed to detect 
the presence of the RNA expression product of SEQ ID NO:5, and the amount of GPBP 

25 gene expression in the sample is compared to the level in a control sample. 

For detecting the nucleic acid sequence of interest, standard labeling techniques 
can be used to label the probe, the nucleic acid of interest, or the complex between the 
probe and the nucleic acid of interest, including, but not limited to radio-, enzyme-, 
chemiluminescent-, or avidin or biotin-labeling techniques, all of which are well known 

30 in the art. (See, for example, Molecular Cloning: A Laboratory Manual (Sambrook, et 
al., 1989, Cold Spring Harbor Laboratory Press), Gene Expression Technology 
(Methods in Enzymology, Vol. 185, edited by D. Goeddel, 1991. Academic Press, San 
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Diego, CA); PCR Protocols: A Guide to Methods and Applications (Innis, et al. 1990. 
Academic Press, San Diego, CA)). 

Such methods of nucleic acid detection are useful for a variety of purposes, 
including but not limited to diagnosing an autoimmune condition, identifying cells 

5 targeted for or undergoing apoptosis, in situ hybridization, Northern and Southern blot 
analysis, and DNA library screening. 

As demonstrated in the following examples, GPBP shows preferential 
expression in tissue structures that are commonly targeted in naturally-occurring 
automimmune responses, and is highly expressed in several autoimmune conditions, 

10 including but not limited to Goodpasture Syndrome (GP), systemic lupus 
erythematosus (SLE), and lichen planus. Furthermore, following a similar 
experimental approach to that described below, recombinant proteins representing 
autoantigens in GP disease (a3 Type IV collagen), SLE (PI ribosomal phosphoprotein 
and Sm-Dl small nuclear ribonucleoproteins) and dermatomyositis (hystididyl-tRNA 

15 synthetase) were shown to be in vitro substrates of GPBP. 

Thus, in a preferred embodiment, detection of GPBP expression is used to 
detect an autoimmune condition. A sample that is being tested is compared to a control 
sample for the expression of GPBP, wherein an increased level of GPBP expression 
indicates the presence of an autoimmune condition. In this embodiment, it is preferable 

20 to use antibodies that selectively bind to GPBPpepl , which is present in GPBP but not 
in GPBPA26. 

Furthermore, as shown in the accompanying examples, GPBP is down- 
regulated in tumor cell lines, and the data suggest that GPBP/GPBPA26 are likely to be 
involved in cell signaling pathways that induce apoptosis, which may be up-regulated 

25 during autoimmune pathogenesis and down-regulated during cell transformation to 
prevent autoimmune attack to transformed cells during tumor growth. Thus, the 
detection methods disclosed herein can be used to detect cells that are targeted for, or 
are undergoing apoptosis. 

In another aspect, the present invention provides a method for treating an 

30 autoimmune disorder, a tumor, or for preventing cell apoptosis comprising modification 
of the expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 
substantially similarly to GPBPpepl in a patient in need thereof. Modifying the 
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expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 
substantially similarly to GPBPpepl can be accomplished by using specific inducers or 
inhibitors of GPBP expression or activity, GPBP antibodies, gene or protein therapy 
using GP or myelin basic protein alternative products, cell therapy using host cells 

5 expressing GP or myelin basic protein alternative products, antisense therapy, or other 
techniques known in the art. In a preferred embodiment, the method further comprises 
administering a substantially purified alternative product of the GP antigen or MBP to 
modify the expression or activity of GPBP, GPBPA26, or a protein comprising a 
polypeptide substantially similarly to GPBPpepl. As used herein, "modification of 

10 expression or activity" refers to modifying expression or activity of either the RNA or 
protein product. 

In a further aspect, the present invention provides pharmaceutical compositions, 
comprising an amount effective of substantially purified alternative products of the GP 
antigen or MBP to modify the expression or activity of GPBP RNA or protein, and a 

15 pharmaceutical^ acceptable carrier. 

For administration, the active agent is ordinarily combined with one or more 
adjuvants appropriate for the indicated route of administration. The compounds may be 
mixed with lactose, sucrose, starch powder, cellulose esters of alkanoic acids, stearic 
acid, talc, magnesium stearate, magnesium oxide, sodium and calcium salts of 

20 phosphoric and sulphuric acids, acacia, gelatin, sodium alginate, polyvinylpyrrolidine, 
and/or polyvinyl alcohol, and tableted or encapsulated for conventional administration. 
Alternatively, the compounds of this invention may be dissolved in saline, water, 
polyethylene glycol, propylene glycol, carboxymethyl cellulose colloidal solutions, 
ethanol, corn oil, peanut oil, cottonseed oil, sesame oil, tragacanth gum, and/or various 

25 buffers. Other adjuvants and modes of administration are well known in the 
pharmaceutical art. The carrier or diluent may include time delay material, such as 
glyceryl monostearate or glyceryl distearate alone or with a wax, or other materials well 
known in the art. 

The present invention may be better understood with reference to the 
30 accompanying examples that are intended for purposes of illustration only and should 
not be construed to limit the scope of the invention, as defined by the claims appended 
hereto. 
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Example 1 : Characterizati d of GPBP 

Here we report the cloning and characterization of a novel type of serine/threonine 
kinase that specifically binds to and phosphorylates the unique N-terminal region of the 
5 human GP antigen. 

MATERIALS AND METHODS 

Synthetic polymers-Peptides. GPpepl, KGKRGDSGSPATWTTRGFVFT (SEQ 
10 ID NO:26), representing residues 3-23 of the human GP antigen and GPpeplAla 9 , 

KGKRGDAGSPATWTTRGFVFT (SEQ ID NO:27), a mutant Ser 9 to Ala 9 thereof, were 

synthesized by MedProbe and CHIRON. FLAG peptide, was from Sigma. 

Oligonucleotides. The following as well as several other GPBP-specific 

oligonucleotides were synthesized by Genosys and GIBCO BRL: 
15 ON-GPBP-54m: 

TCGAATTCACCATGGCCCCACTAGCCGACTACAAGGACGACGATG 

ACAAG (SEQ ID NO: 28). 

ON-GPBP-55c: 

CCGAGCCCGACGAGTTCCAGCTCTGATTATCCGACATCTTGTCATCG 
20 TCG (SEQ ID NO:29). 

ON-HNC-B-N-14m: CGGGATCCGCTAGCTAAGCCAGGCAAGGATGG (SEQ ID 
NO:30). 

ON-HNC-B-N-I6c: CGGGATCCATGCATAAATAGCAGTTCTGCTGT (SEQ ID 
NO:31). 

25 Isolation and characterization of cDNA clones encoding human GPBP- 

Several human X-gtl 1 cDNA expression libraries (eye, fetal and adult lung, kidney and 
HeLa S3, from CLONTECH) were probed for cDNAs encoding proteins interacting 
with GPpepl. Nitrocellulose filters (Millipore) prepared following standard 
immunoscreening procedures were blocked and incubated with 1-10 nmoles per ml of 
30 GPpepl at 37°C. Specifically bound GPpepl was detected using M3/1A monoclonal 
antibodies (7). A single clone was identified in the HeLa-derived library (HeLal). 
Specificity of fusion protein binding was confirmed by similar binding to recombinant 
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eukaryotic human GP antigen. The EcoRI cDNA insert of HeLal (0.5-kb) was used to 
further screen the same library and to isolate overlapping cDNAs. The largest cDNA 
(2.4-kb) containing the entire cDNA of HeLal (n4*) was fully sequenced. 

Northern and Southern blots-Pre-made Northern and Southern blots 

5 (CLONTECH) were probed with HeLal cDNA following manufacturer instructions. 

Plasmid construction, expression and purification of recombinant proteins- 
GPBP-derived material. The original X-gtll HeLal clone was expressed as a lysogen 
in E. Coli Y1089 (8). The corresponding (3-gaiactosidase-derived fusion protein 
containing the N-tenninal 150 residues of GPBP was purified from the cell lysate using 

10 an APTG-agarose column (Boehringer). The EcoRI 2.4-kb fragment of n4' was 
subcloned in Bluescribe Ml 3+ vector (Stratagene) (BS-n4'), amplified and used for 
subsequent cloning. A DNA fragment containing (from 5' to 3'), an EcoRI restriction 
site, a standard Kozak consensus for translation initiation, a region coding for a tag 
peptide sequence (FLAG, DYKDDDDK (SEQ ID NO:32)), and the sequence coding 

15 for the first eleven residues of GPBP including the predicted Meti and a Ban II 
restriction site, was obtained by hybridizing ON-GPBP-54m and ON-GPBP-55c, and 
extending with modified T7 DNA polymerase (Amersham). The resulting DNA product 
was digested with EcoRI and Banll, and ligated with the Banll/EcoRI cDNA fragment 
of BS-n4' in the EcoRI site of pHIL-D2 (Invitrogen) to produce pHIL-FLAG-n4\ This 

20 plasmid was used to obtain Mut s transformants of the GS1 15 strain of Pickia pastoris 
and to express FLAG-tagged recombinant GPBP (rGPBP) either by conventional liquid 
culture or by fermentation procedures (Pichia Expression Kit, Invitrogen). The cell 
lysates were loaded onto an anti-FLAG M2 column (Sigma), the unbound material 
washed out with Tris buffered saline (TBS, 50 mM Tris-HCl, pH 7.4, 150 mM NaCl) 

25 or salt-supplemented TBS (up to 2M NaCl), and the recombinant material eluted with 
FLAG peptide. For expression in cultured human kidney-derived 293 cells (ATCC 
1573-CRL), the 2.4- or 2.0-kb EcoRI cDNA insert of either BS-n4' or pHIL-FLAG-n4' 
was subcloned in pcDNA3 (Invitrogen) to produce pc-n4' and pc-FLAG-n4 ) 
respectively. When used for transient expression, 18 hours after transfection the cells 

30 were lysed with 3.5-4 |il/cm 2 of chilled lysis buffer (1% Nonidet P-40 or Triton-XlOO, 
5mM EDTA and 1 mM PMSF in TBS) with or without 0.1% SDS, depending on 
whether the lysate was to be used for SDS-PAGE or FLAG-purification, respectively. 
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For FLAG purification, the lysate of four to six 175 cm culture dishes was diluted up 
to 50 ml with lysis buffer and purified as above. For stable expression, the cells were 
similarly transfected with pc-n4' and selected for three weeks with 800 jug/ml of G418. 
For bacterial recombinant expression, the 2.0-kb EcoRI cDNA fragment of pHIL- 

5 FLAG-n4' was cloned in-frame downstream of the glutathione ^-transferase (GST> 
encoding cDNA of pGEX-5x-l (Pharmacia). The resulting construct was used to 
express GST-GPBP fusion protein in DH5ot cells (9). 

GP antigen-derived material. Human recombinant GP antigen (rGP) was 
produced in 293 cells using the pRc/CMV-BM40 expression vector containing the a3- 

10 specific cDNA between ON-HNC-B-N-14m and ON-HNC-B-N-16c. The expression 
vector is a pRc/CMV (Invitrogen)-derived vector provided by Billy G. Hudson (Kansas 
University Medical Center) that contains cDNA encoding an initiation Met, a BM40 
signal peptide followed by a tag peptide sequence (FLAG), and a polylinker cloning 
site. To obtain a3-specific cDNA, a polymerase chain reaction was performed using 

15 the oligonucleotides above and a plasmid containing the previously reported a3(IV) 
cDNA sequence (3) as template (clone C2). For stable expression of rGP, 293 cells 
were transfected with the resulting construct (fa3VLC) and selected with 400 |ig/ml of 
G418. The harvested rGP was purified using an anti-FLAG M2 column. 

All the constructs were verified by restriction mapping and nucleotide 

20 sequencing. 

Cell culture and DNA transfection-Human 293 cells were grown in 
' Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% fetal calf 
serum. Transfections were performed using the calcium phosphate precipitation method 
of the Profection Mammalian Transfection Systems (Promega). Stably transfected cells 
25 were selected by their resistance to G418. Foci of surviving cells were isolated, cloned 
and amplified. 

Antibody productioa-Polyclonal antibodies against the N-terminal region of 
GPBP. Cells expressing HeLal A.-gtl 1 as a lysogen were lysed by sonication in the 
presence of Laemmli sample buffer and subjected to electrophoresis in a 7.5% 
30 acrylamide preparative gel The gel was stained with Coomassie blue and the band 
containing the fusion protein of interest excised and used for rabbit immunization (10). 
The anti-serum was tested for reactivity using APTG-affmity purified antigen. To 
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obtain affinity-purified antibodies, the anti-serum was diluted 1:5 with TBS and loaded 

onto a Sepharose 4B column containing covalently bound affinity purified antigen. The 

bound material was eiuted and, unless otherwise indicated, used in the 

immunochemical studies. 
5 Monoclonal antibodies against GPBP. Monoclonal antibodies were produced 

essentially as previously reported (7) using GST-GPBP. The supernatants of individual 

clones were analyzed for antibodies against rGPBP. 

In vitro phosphorylation assays-About 200 ng of rGPBP were incubated 

overnight at 30°C in 25 mM P-glycerolphosphate (pH 7.0), 0.5 mM EDTA, 0.5 mM 
10 EGTA, 8 mM MgCl 2 , 5 mM MnCl 2 , 1 mM DTT and 0.132 pM y- 32 P-ATP, in the 

presence or absence of 0.5-1 jig of protein substrates or 10 nmoles of synthetic 

peptides, in a total volume of 50 

In vivo phosphorylation assays-Individual wells of a 24-well dish were seeded 

with normal or with stably pc-n4' transfected 293 cells. When the cells were grown to 
15 the desired density, a number of wells of the normal 293 cells were transfected with pc- 

FLAG-n4'. After 12 hours, the culture medium was removed, 20 f^Ci/well of H3 32 P04 

in 100 }il of phosphate-free DMEM added, and incubation continued for 4 hours. The 

cells were lysed with 300 nl/well of TBS containing 1% Triton X-100, 2 mM EDTA, 1 

mM PMSF, 50 mM NaF and 0.2 mM vanadate, and extracted with specific antibodies 
20 and Protein A-Sepharose. When anti-GPBP serum was used, the lysate was pre-cleared 

using pre-immune serum and Protein A-Sepharose. 

In vitro dephosphorylation of rGPBP-About 1 \ig of rGPBP was 

dephosphorylated in 100 |il of 10 mM Tris-acetate (pH 7.5), 10 mM magnesium acetate 

and 50 mM potassium acetate with 0.85 U of calf intestine alkaline phosphatase 
25 (Pharmacia) for 30 min at 30°C. 

Renaturation assays-In-blot renaturation assays were performed using 1-5 ng 

of rGPBP as previously described (11). 

Nucleotide sequence analysis- cDNA sequence analyses were performed by 

the dideoxy chain termination method using [a] 35 S-dATP, modified T7 DNA 
30 polymerase ( Amersham) and universal or GPBP-specific primers (8- 1 0). 
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32 P-Phosphoamino acid analysis-Immunopurified rGPBP or HPLC gel- 
filtration fractions thereof containing the material of interest were phosphorylated, 
hydrolyzed and analyzed in one dimensional (4) or two dimensional thin layer 
chromatography (12). When performing two dimensional analysis, the buffer for the 

5 first dimension was formic acid:acetic acid:water (1:3.1:35.9) (pH 1.9) and the buffer 
for the second dimension was acetic acid:pyridine:water (2:0.2:37.8) (pH 3.5). Amino 
acids were revealed with ninhydrin, and 32 P-phosphoamino acids by autoradiography. 

Physical methods and immunochemical techniques-SDS-PAGE and 
Western-blotting were performed as in (4). Immunohistochemistry studies were done 

10 on human multi-tissue control slides (Biomeda, Biogenex) using the ABC peroxidase 
method (13). 

Computer analysis-Homology searches were carried out against the GenBank 
and SwissProt databases with the BLAST 2.0 (14) at the NCBI server, and against the 
TIGR Human Gene Index database for expressed sequence tags, using the Institute for 
15 Genomic Research server. The search for functional patterns and profiles was 
performed against the PROSITE database using the ProfileScan program at the Swiss 
Institute of Bioinformatics (15). Prediction of coiled-coil structures was done at the 
Swiss Institute for Experimental Cancer Research using the program Coils (16) with 
both 21 and 28 residue windows. 

20 

RESULTS 

Molecular cloning of GPBP-To search for proteins specifically interacting with 
25 the divergent N-terminal region of the human GP antigen, a 2 1 -residue peptide (GPpep 1 ; 
SEQ ID NO:26)), encompassing this region and flanking sequences, and specific 
monoclonal antibodies against it were combined to screen several human cDNA 
expression libraries. More than 5 x 10 6 phages were screened to identify a single HeLa- 
derived recombinant encoding a fusion protein specifically interacting with GPpep 1 
30 without disturbing antibody binding. 

Using the cDNA insert of the original clone (HeLal), wc isolated a 2.4-kb cDNA 
(n4') that contains 408-bp of 5' -untranslated sequence, an open reading frame (ORF) of 
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1872-bp encoding 624 residues, and 109-bp of 3' -untranslated sequence (Fig. 1) (SEQ ID 
NO:l-2). Other structural features are of interest. First, the predicted polypeptide 
(hereinafter referred to as GPBP) has a large number of phosphorylatable (17.9%) and 
acidic (16%) residues unequally distributed along the sequence. Serine, which is the most 

5 abundant residue (9.3%), shows preference for two short regions of the protein, where it 
comprises nearly 40% of the amino acids, compared to an average of less than 7% 
throughout the rest of the polypeptide chain. It is also noteworthy that the more N- 
terminal, serine-rich region consists mainly of a Ser-Xaa-Yaa repeat. Acidic residues are 
preferentially located at the N-terminal three-quarters of the polypeptide, with nearly 18% 

10 of the residues being acidic. These residues represent only 9% in the most C-terminal 
quarter of the polypeptide, resulting in a polypeptide chain with two electrically opposite 
domains. At the N-terminus, the polypeptide contains a pleckstrin homology (PH) 
domain, which has been implicated in the recruitment of many signaling proteins to the 
cell membrane where they exert their biological activities (17). Finally, a bipartite nuclear 

15 targeting sequence (18) exists as an integral part of a heptad repeat region that meets all 
the structural requirements to form a coiled-coil (16). 

Protein data bank searches revealed homologies almost exclusively within the 
approximately 100 residues at the N-terminal region harboring the PH domain. The PH 
domain of the oxysterol-binding protein is the most similar, with an overall identity of 

20 33.5% and a similarity of 65.2% with GPBP. In addition, the Caenorhabditis elegans 
cosmid F25H2 (accession number Q93569) contains a hypothetical ORF that displays an 
overall identity of 26.5% and a similarity of 61% throughout the entire protein sequence, 
indicating that similar proteins are present in lower invertebrates. Several human 
expressed sequence tags (accession numbers AA287878, AA287561, AA307431, 

25 AA331618, AA040134, AA158618, AA040087, AA122226, AA158617, AA121104, 
AA412432, AA412433, AA282679 and N27578) possess a high degree of nucleotide 
identity (above 98%) with the corresponding stretches of the GPBP cDNA, suggesting 
that they represent human GPBP. Interestingly, the AA287878 EST shows a gap of 67 
nucleotides within the sequence corresponding to the GPBP S'-untranslated region, 

30 suggesting that the GPBP pre-mRNA is alternatively spliced in human tissues (not 
shown). 
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The distribution and expression of the GPBP gene in human tissues was first 
assessed by Northern blot analysis (Fig. 2, panel A). The gene is expressed as two major 
mRNAs species between 4.4-kb and 7.5-kb in length and other minor species of shorter 
lengths. The structural relationship between these multiple mRNA species is not known 
5 and their relative expression varies between tissues. The highest expression level is seen in 
striated muscle (skeletal and heart), while lung and liver show the lowest expression 
levels. 

Southern blot studies analysis of genomic DNA from different species indicated 
that homologous genes exist throughout phytogeny (Fig. 2, panel B). Consistent with the 

10 human origin of the probe, the hybridization intensities decreased in a progressive fashion 
as the origin of the genomic DNA moves away from humans in evolution. 

Experimental determination of the translation start site-To experimentally 
confirm the predicted ORF, eukaryotic expression vectors containing either the 2.4-kb of 
cDNA of n4\ or only the predicted ORF tagged with a FLAG sequence (Fig. 3 A), were 

15 used for transient expression assays in 293 cells. The corresponding extracts were 
analyzed by immunoblot using GPBP- or FLAG-specific antibodies. The GPBP-specific 
antibodies bind to a similar major polypeptide in both transfected cells, but only the 
polypeptide produced by the engineered construct expressed the FLAG sequence (Fig. 
3B). This located the translation start site of the n4* cDNA at the predicted Met and 

20 confirmed the proposed primary structure. Furthermore, the recombinant polypeptides 
displayed a molecular mass higher than expected (80 versus 71 kDa) suggesting that 
GPBP undergoes post-translational modifications. 

Expression and characterization of yeast rGPBP-Yeast expression and FLAG- 
based affinity-purification were combined to produce rGPBP (Fig. 4A). A major 

25 polypeptide of -89 kDa, along with multiple related products displaying lower M r , were 
obtained. The recombinant material was recognized by both anti-FLAG and GPBP- 
specific antibodies, guaranteeing the fidelity of the expression system. Again, however, 
the M T displayed by the major product was notably higher than predicted and even higher 
than the M r of the 293 cell-derived recombinant material, supporting the idea that GPBP 

30 undergoes important and differential post-translational modifications. Since 
phosphorylatable residues are abundant in the polypeptide chain, we investigated the 
existence of phosphoamino acids in the recombinant materials. By using monoclonal or 
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polyclonal (not shown) antibodies against phosphoserine (Pser), phosphothreonine (PThr) 
and phosphotyrosine (PTyr), we identified the presence of all three phosphoresidues either 
in yeast rGPBP (Fig. 4B) or in 293 cell-derived material (not shown). The specificity of 
the antibodies was further assessed by partially inhibiting their binding by the addition of 
5-10 mM of the corresponding phosphoamino acid (not shown). This suggests that the 
phosphoresidue content varies depending upon the cell expression system, and that the M r 
differences are mainly due to phosphorylation. Dephosphorylated yeast-derived material 
consistently displayed similar M T to the material derived from 293 cells, and 
phosphoamino acid content correlates with SDS-PAGE mobilities (Fig.4C). As an in vivo 
measurement, the phosphorylation of rGPBP in the 293 cells was assessed (Fig. 4D). 
Control cells (lanes 1) and cells expressing rGPBP in a stable (lanes 2) or transient (lanes 
3) mode were cultured in the presence of H 3 32 PC>4. Immunoprecipitated recombinant 
material contained 32 P, indicating that phosphorylation of GPBP occurred in vivo and 
therefore is likely to be a physiological process. 

The rGPBP is a serine/threonine kinase that phosphorylates the N-terminal 
region of the human GP antigen-Although GPBP does not contain the conserved 
structural regions required to define the classic catalytic domain for a protein kinase, the 
recent identification and characterization of novel non-conventional protein kinases (19- 
27) encouraged the investigation of its phosphorylating activity. Addition of [y 32 P]ATP to 
rGPBP (either from yeast or 293 cells (not shown)) in the presence of Mn 2+ and Mg 2+ 
resulted in the incorporation of 32 P as PSer and PThr in the major and related products 
recognized by both anti-FLAG and specific antibodies (Fig. 5 A and B), indicating that the 
affinity-purified material contains a Ser/Thr protein kinase. To further characterize this 
activity, GPpepl, GPpeplAla 9 (a GPpepl mutant with Ser 9 replaced by Ala), native and 
recombinant human GP antigens, and native bovine GP antigen were assayed (Fig. SC). 
Affinity-purified rGPBP phosphorylates all human-derived material to a different extent. 
However, in similar conditions, no appreciable 32 P-incorporation was observed in the 
bovine-derived substrate. The lower 32 P incorporation displayed by GPpeplAla 9 when 
compared with GPpepl, and the lack of phosphorylation of the bovine antigen, indicates 
that the kinase present in rGPBP discriminates between human and bovine antigens, and 
that Ser 9 is a target for the kinase. 
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Although the purification system provides high quality material, the presence of 
contaminants with a protein kinase activity could not be ruled out. The existence of 
contaminants was also suggested by the presence of a FLAG-containing 40 kDa 
polypeptide, which displayed no reactivity with specific antibodies nor incorporation of 

5 32 P in the phosphorylation assays (Fig. 4A and 5A). To precisely identify the polypeptide 
harboring the protein kinase activity, we performed in vitro kinase renaturation assays 
after SDS-PAGE and Western-blotted (Fig. 6). We successfully combined the use of 
specific antibodies (lane 1) and autoradiographic detection of in situ 32 P-incorporation 
(lane 2), and identified the 89 kDa rGPBP material as the primary polypeptide harboring 

10 the Ser/Thr kinase activity. The lack of 32 P-incorporation in the rGPBP-derived products, 
as well as in the 40 kDa contaminant, further supports the specificity of the renaturation 
assays and locates the kinase activity to the 89 kDa polypeptide, Recently, it has been 
shown that traces of protein kinases intimately associated with a polypeptide can be 
released from the blot membrane, bind to, and phosphorylate the polypeptide during the 

15 labeling step (28). To assess this possibility in our system, we performed renaturation 
studies using a small piece of membrane containing the 89 kDa polypeptide, either alone 
or together with membrane pieces representing the different regions of the blot lane. We 
observed similar 32 P-incorporation at the 89 kDa polypeptide regardless of the co- 
incubated pieces (not shown), indicating that if there are co-purified protein kinases in our 

20 sample they are not phosphorylating the 89 kDa polypeptide in the renaturation assays 
unless they co-migrate. Co-migration does not appear to be a concern, however, since 
rGPBP deletion mutants (GPBPA26 and R3; see below) displaying different mobilities 
also have kinase activities and could be similarly in-blot renatured (not shown). 

Immunohistochemical localization of the novel kinase-To investigate GPBP 

25 expression in human tissues we performed immunohistochemical studies using specific 
polyclonal (Fig. 7) or monoclonal antibodies (not shown). Although GPBP is widely 
expressed in human tissues, it shows tissue and cell-specificity. In kidney, the major 
expression is found at the tubule epithelial cells and the glomerular mesangial cells and 
podocytes. At the lung alveolus, the antibodies display a linear pattern suggestive of a 

30 basement membrane localization, along with staining of pneumocytes. Liver shows low 
expression in the parenchyma, but high expression in biliary ducts. Expression in the 
central nervous system is observed in the white matter, but not in the neurons of the brain. 
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In testis, a high expression in the spermatogonium contrasts with the lack of expression in 
Sertoli cells. The adrenal gland shows a higher level of expression in cortical cells versus 
the medullar. In the pancreas, GPBP is preferentially expressed in Langerhans islets 
versus the exocrine moiety. In prostate, GPBP is expressed in the epithelial cells but not 

5 in the stroma (Fig. 7). Other locations with high expression of GPBP are striated muscle, 
epithelial cells of intestinal tract, and Purkinje cells of the cerebellum (not shown). In 
general, in tissues where GPBP is highly expressed the staining pattern is mainly diffuse 
cytosolic. However in certain locations there is, in addition, an important staining 
reinforcement at the nucleus (spermatogonium), at the plasma membrane (pneumocyte, 

10 hepatocytc, prostate epithelial cells, white matter) or at the extracellular matrix (alveolus) 
(Fig. 7). 

DISCUSSION 

Our data show that GPBP is a novel, non-conventional serine/threonine kinase. 

15 We also present evidence that GPBP discriminates between human and bovine GP 
antigens, and targets the phosphorylatable region of human GP antigen in vitro. Several 
lines of evidence indicate that the 89 kDa polypeptide is the only kinase in the affinity 
purified rGPBP. First, we found no differences in auto- or trans-phosphorylation among 
rGPBP samples purified in the presence of 150 mM, 0.5 M, 1 M or 2 M salt (not shown), 

20 suggesting that rGPBP does not cany intimately bound kinases. Second, there is no 
FLAG-containing, yeast-derived kinase in our samples, since material purified using 
GPBP-specific antibodies shows no differences in phosphorylation (not shown). Third, a 
deletion mutant (GPBPA26; see below) displays reduced auto- and trans-phosphorylation 
activities (not shown), demonstrating that the 89 kD polypeptide is the only portion of the 

25 rGPBP with the ability to carry out phosphate transfer. 

Although GPBP is not homologous to other non-conventional kinases, they share 
some structural features including an N-tenninal a-helix coiled-coil (26, 27), serine-rich 
motifs (24), high phosphoamino acids content (27), bipartite nuclear localization signal 
(27), and the absence of a typical nucleotide or ATP binding motif (24, 27). 

30 Immunohistochemistry studies show that GPBP is a cytosolic polypeptide also 

found in the nucleus, associated with the plasma membrane and likely at the extracellular 
matrix associated with the basement membrane, indicating that it contains the structural 
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requirements to reach all these destinations. The nuclear localization signal and the PH 
domain confer to it the potential to reach the nucleus and the cell membrane, respectively 
(17, 29, 30). Although GPBP does not contain the structural requirements to be exported, 
the 5'-end untranslated region of its mRNA includes an upstream ORF of 130 residues 

5 with an in-frame stop codon at the beginning (Fig. 1 ). A mRNA editing process inserting a 
single base pair (U) would generate an operative in-frame start site and an ORF of 754- 
residues containing an export signal immediately downstream of the edited Met (not 
shown). Polyclonal antibodies against a synthetic peptide representing part of this 
hypothetical extra-sequence (PRSARCQARRRRGGRTSS (SEQ ID NO:33)) display a 

10 linear vascular reactivity in human tissues suggestive of an extracellular basement 
membrane localization (data not shown). 

Alternatively, a splicing phenomenon could generate transcripts with additional 
unidentified exon(s) that would provide the structural requirements for exportation. The 
multiple cellular localization, the high content in PTyr, and the lack of tyrosine kinase 

15 activity in vitro, suggest that GPBP is itself the target of specific tyrosine kinase(s) and 
therefore likely involved in specific signaling cascade(s). 

As discussed above, specific serine phosphorylation, as well as pre-mRNA 
alternative splicing, are associated with the biology of several autoantigens, including the 
GP antigen, acetylcholine receptor and myelin basic protein (MBP) (4). The latter is 

20 suspected to be the major antigen in multiple sclerosis (MS), another exclusively human 
autoimmune disease in which the immune system targets the white matter of the central 
nervous system. GP disease and MS are human disorders that display a strong association 
with the same HLA class II haplotype (HLA DRB1*1501)(32, 33). This, along with the 
recent report of death by GP disease of a MS patient carrying this HLA specificity (34), 

25 supports the existence of common pathogenic events in these human disorders. 

Phosphorylation of specific serines has been shown to change intracellular 
proteolysis (35-40). Conceivably, alterations in protein phosphorylation can affect 
processing and peptide presentation, and thus mediate autoimmunity. GP antigen-derived 
peptide presentation by the HLA-DR15 depends more on processing than on preferences 

30 of relatively indiscriminate DR15 molecules (41), suggesting that if processing is 
influenced by abnormal phosphorylation, the resulting peptides would likely be presented 
by this HLA. Our more recent data indicate that in both the GP and MBP systems, the 
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production of alternative splicing products serves to regulate the phosphorylation of 
specific and structurally homologous PKA sites, suggesting that this or a closely related 
kinase is the in vivo phosphorylating enzyme. Alterations in the degree of antigen 
phosphorylation, caused either by an imbalance in alternative products, or by the action of 

5 an intruding kinase that deregulates phosphorylation of the same motifs, could lead to an 
autoimmune response in predisposed individuals. rGPBP phosphorylates the human GP 
antigen at a major PKA phosphorylation site in an apparently unregulated fashion, since 
the presence of specific alternative products of the GP antigen did not affect 
phosphorylation of the primary antigen by GPBP (not shown). 

10 Although GPBP is ubiquitously expressed, in certain organs and tissues it shows a 

preference for cells and tissue structures that are target of common autoimmune responses: 
the Langerhans cells (type I diabetes); the white matter of the central nervous system 
(multiple sclerosis); the biliary ducts (primary biliary cirrhosis); the cortical cells of the 
adrenal gland (Addison disease); striated muscle cells (myasthenia gravis); 

15 spermatogonium (male infertility); Purkinje cells of the cerebellum (paraneoplasic 
cerebellar degeneration syndrome); and intestinal epithelial cells (pernicious anemia, 
autoimmune gastritis and enteritis). All the above observations point to this novel kinase 
as an attractive candidate to be considered when envisioning a model for human 
autoimmune disease. 

20 
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Example 2: GPBP Alternative Splicing 

Here we report the existence of two isoforms of GPBP that are generated by 
alternative splicing of a 78-base pair (bp) long exon that encodes a 26-residue serine- 
rich motif. Both isoforms, GPBP and GPBPA26, exist as high molecular aggregates 
5 that result from polypeptide self-aggregation. The presence of the 26-residue peptide in 
the polypeptide chain results in a molecular species that self-interacts more efficiently 
and forms aggregates with higher specific activity. Finally, we present evidences 
supporting the observation that GPBP is implicated in human autoimmune 
pathogenesis. 

10 MATERIAL AND METHODS. 
Synthetic polymers; 

Veptides. GPpepl, KGKRGDSGSPATWTTRGFVFT (SEQ ID NO:26), is described in 
Example 1. GPBPpepl, PYSRSSSMSSIDLVSASDDVHRFSSQ (SEQ ID NO:14), 
representing residues 371-396 of GPBP was synthesized by Genosys. 

15 Oligonucleotides. The following oligonucleotides were synthesized by Life 
Technologies, Inc., 5' to 3 1 : ON-GPBP-1 1m, G CGG GAC TCA GCG GCC GGA TTT 
TCT (SEQ ID NO:34); ON-GPBP-1 5m, AC AGC TGG CAG AAG AGA C (SEQ ID 
NO:35); ON-GPBP-20c, C ATG GGT AGC TTT TAA AG (SEQ ID NO; 36); ON- 
GPBP-22m, TA GAA GAA CAG TCA CAG AGT GAA AAG G (SEQ ID NO;37); 

20 ON-GPBP-53c, GAATTC GAA CAA AAT AGG CTT TC (SEQ ID NO:38); ON- 
GPBP-56m, CCC TAT AGT CGC TCT TC (SEQ ID NO:39); ON-GPBP-57c, CTG 
GGA GCT GAA TCT GT (SEQ ID NO:40); ON-GPBP-62c, GTG GTT CTG CAC 
CAT CTC TTC AAC (SEQ ID NO:41); ON-GPBP-A26, CA CAT AGA TTT GTC 
CAA AAG GTT GAA GAG ATG GTG CAG AAC (SEQ ID NO:42). 

25 Reverse transcriptase and polymerase chain rection (RT-PCR). Total RNA was 
prepared from different control and GP tissues as described in (15). Five micrograms of 
total RNA was retrotranscribed using Ready-To-Go You-Prime First-Strand beads 
(Amersham Pharmacia Biotech) and 40 pmol of ON-GPBP-53c. The corresponding 
cDNA was subjected to PCR using the pairs of primers ON-GPBP-1 lm/ON-GPBP-53c 

30 or ON-GPBP- 1 5m/ON-GPBP-62c. The identity of the products obtained with 1 5m-62c 
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was further confirmed by AIu I restriction. To specifically amplify GPBP transcripts, 
PGR was performed using primers ON-GPBP-l5m/ON-GPBP-57c. 
Northern hybridization studies. Pre-made human multiple-tissue and tumor cell-line 
Northern Blots (CLONTECH) were probed with a cDNA containing the 78-bp exon 

5 present only in GPBP or with a cDNA representing both isoforms. The corresponding 
cDNAs were obtained by PCR using the pair of primers ON-GPBP-56m and ON- 
GPBP-57c using GPBP as a template, or with primers ON-GPBP-22m and ON-GPBP- 
20c, using GPBPA26 as a template. The resulting products were random-labeled and 
hybridized following the manufacturers' instructions. 

10 Plasmid construction, expression and purification of recombinant proteins. The 
plasmid pHIL-FLAG-n4\ used for recombinant expression of FLAG-tagged GPBP in 
Pichia pastoris has been described elsewhere (4). The sequence coding for the 78-bp 
exon was deleted by site-directed mutagenesis using ON-GPBP-A26 to generate the 
plasmid pHIL-FLAG-n4'A26. Expression and affinity-purification of recombinant 

15 GPBP and GPBPA26 was done as in (4). 

Gel-filtration HPLG Samples of 250 ^1 were injected into a gel filtration PE-TSK- 
G4000SW HPLC column equilibrated with 50 mM Tris-HCl pH 7.5, 150 mM NaCl. 
The material was eluted from the column at 0.5 ml/min, monitored at 220 nm and 
minute fractions collected. 

20 In vitro phosphorylation assays. The auto-, trans-phosphorylation and in-blot 
renaturation studies were performed as in Example 1 . 

Antibodies and immunochemical techniques. Polyclonal antibodies were raised by in 
chicken against a synthetic peptide (GPBPpepl) representing the sequence coded by 
the 78-bp exon (Genosys). Egg yolks were diluted 1:10 in water, the pH adjusted to 5.0. 

25 After 6 hours at 4"C, the solution was clarified by centrifugation (25 min at 10000 x g at 
4°C) and the antibodies precipitated by adding 20 % (w/v) of sodium sulfate at 20.000 x 
g, 20*. The pellets were dissolved in PBS (1 ml per yolk) and used for 
immunohistochemical studies. The production of antibodies against GPBP/GPBPA26 
or against a3(IV)NCl domain are discussed above (see also 4, 13). 

30 Sedimentation velocity. Determination of sedimentation velocities were performed in 
an Optima XL- A analytical ultracentrifuge (Beckman Instruments Inc.), equipped with 
a VIS-UV scanner, using a Ti60 rotor and double sector cells of Epon-charcoal of 12 
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mm optical path-length. Samples of ca. 400 \xl were centrifuged at 30,000 rpm at 20°C 
and radial scans at 220 nm were taken every 5 min. The sedimentation coefficients 
were obtained from the rate of movement of the solute boundary using the program 
XLAVEL (supplied by Beckman). 

5 Sedimentation equilibrium. Sedimentation equilibrium experiments were done as 
described above for velocity experiments with samples of 70 and centrifuged at 
8,000 rpm. The experimental concentration gradients at equilibrium were analyzed 
using the program EQASSOC (Beckman) to determine the corresponding weight 
average molecular mass. A partial specific volumes of 0.71 1 cm 3 /g for GPBP and 0.729 

10 cm 3 /g for GPBPA26 were calculated from the corresponding amino acid compositions. 
Physical methods and immunochemical techniques. SDS-PAGE and Western 
blotting were performed under reducing conditions as previously described (3). 
Immunohistochemistry studies were done on formalin fixed paraffin embedded tissues 
using the ABC peroxidase method (4) or on frozen human biopsies fixed with cold 

15 acetone using standard procedures for indirect immunofluorescence. 

Two hybrid studies. Self-interaction studies were carried out in Saccharomyces 
cerevisiae (HF7c) using pGBT9 and pGAD424 (CLONTECH) to generate GAL4 
binding and activation domain-fusion proteins, respectively. Interaction was assessed 
following the manufacture's recommendations. P-galactosidase activity was assayed 

20 with X-GAL (0.75 mg/ml) for in situ and with ortho-nitrophenyl (J-D galactopyranoside 
(0.64 mg/ml) for the in-solution determinations. 

RESULTS 

Identification of two spliced GPBP variants. To characterize the GPBP 
25 species in normal human tissues, we coupled reverse transcription to a polymerase 
chain reaction (RT-PCR) on total RNA from different tissues, using specific 
oligonucleotides that flank the full open reading frame of GPBP. A single cDNA 
fragment displaying lower size than expected was obtained from skeletal muscle- 
derived RNA (Fig.8A), and from kidney, lung, skin, or adrenal gland-derived RNA (not 
30 shown). By combining nested PCR re-amplifications and endonuclease restriction 
mapping, we determined that all the RT-PCR products corresponded to the same 
molecular species (not shown). We fully sequenced the 2.2-Kb of cDNA from human 
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muscle and found it identical to HeLa-derived material except for the absence of 78- 
nucleotides (positions 1519-1596), which encode a 26-residues motif (amino acids 371- 
396) (Fig. 8B). We therefore named this more common isoform of GPBP as GPBPA26. 

To investigate whether the 78-bp represent an exon skipped transcript during 
5 pre-mRNA processing, we used this cDNA fragment to probe a human-derived 
genomic library and we isolated a -14-Kb clone. By combining Southern blot 
hybridization and PCR, the genomic clone was characterized and a contiguous DNA 
fragment of 12482-bp was fully sequenced (SEQ ID 25). The sequence contained (from 
5' to 3'), 767-bp of intron sequence, a 93-bp exon, an 818-bp intron, the 78-bp exon 
10 sequence of interest, a 9650-bp intron, a 96-bp exon and a 980-bp intron sequence (Fig. 
8C). The exon-intron boundaries determined by comparing the corresponding DNA and 
cDNA sequences meet the canonical consensus for 5' and 3' splice sites (Fig 8C) (5), 
thus confirming the exon nature of the 78-bp sequence. The GPBP gene was localized 
to chromosome 5ql3 by fluorescence in situ hybridization (FISH) using the genomic 
15 clone as a probe (not shown). 

The relative expression of GPBP in human-derived specimens was assessed by 
Northern blot analysis, using either the 78-bp exon or a 260-bp cDNA representing the 
flanking sequence of 78-bp (103-bp 5' and 157-bp 3') present in both GPBP and 
GPBPA26 (Fig. 9). The 78-bp containing the molecular species of interest were 
20 preferably expressed in striated muscle (both skeletal and heart) and brain, and poorly 
expressed in placenta, lung and liver. In contrast to GPBPA26, the GPBP was expressed 
at very low levels in kidney, pancreas and cancer cell lines. 

All the above indicates that GPBP is expressed at low levels in normal human 
tissues, and that the initial lack of detection by RT-PCR of GPBP can be attributed to a 

25 preferential amplification of the more abundant GPBPA26, Indeed, the cDNA of GPBP 
could be amplified from human tissues (skeletal muscle, lung, kidney, skin and adrenal 
gland) when the specific RT-PCR amplifications were done using 78-bp exon-specific 
oligonucleotides (not shown). This also suggests that GPBPA26 mRNA is the major 
transcript detected in Northern blot studies when using the cDNA probe representing 

30 both GPBP and GPBPA26. 
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Recombinant expression and functional characterization of GPBPA26, To 

investigate whether the absence of the 26-residue serine-rich motif would affect the 
biochemical properties of GPBP, we expressed and purified both isoforms (rGPBP and 
rGPBP A26), and assessed their auto- and trans-phosphorylation activities (Fig. 10). As 

5 reported above for rGPBP (see also 4), rGPBPA26 is purified as a single major 
polypeptide and several related minor products (Fig. 10 A). However, the number and 
relative amounts of the derived products vary compared to rGPBP, and they display M r 
on SDS-PAGE that cannot be attributed simply to the 26-residue deletion. This 
suggests that the 26-residue motif has important structural and functional consequences 

10 that could account for the reduced in-solution auto- and trans-phosphorylation activities 
displayed by rGPBPA26 (Fig.lOB). Interestingly, the differences in specific activity 
shown in the in-solution assays were not evident when autophosphorylation was 
assessed in-blot after SDS-PAGE and renaturation, suggesting that the 26-residue motif 
likely has important functional consequences at the quaternary structure level 

15 Renaturation studies further showed that phosphate transfer activities reside in the 
major polypeptides representing the proposed open reading frames, and are not 
detectable in derived minor products. 

rGPBP and rGPBP-26 exist as very active high molecular weight 
aggregates. Gel filtration analysis of affinity-purified rGPBP or rGPBP A26 yielded 

20 two chromatographic peaks (1 and II), both displaying higher MW than expected for the 
individual molecular species, as determined by SDS-PAGE studies (89 kDa and 84 
kDa, respectively ) (Fig. 11). The bulk of the recombinant material eluted as a single 
peak between the 1 58 kDa and the 669 kDa molecular weight markers (peak II), while 
limited amounts of rGPBP and only traces of rGPBP A26 eluted in peak I (>1000 kDa). 

25 Aliquots of fractions representing each chromatographic profile were subjected to SDS- 
PAGE and stained, or incubated in the presence of 32 P[y] ATP, and analyzed by 
immunoblot and autoradiography. Along with the major primary polypeptide, every 
chromatographic peak contained multiple derived products of higher or lower sizes 
indicating that the primary polypeptide associates to form high molecular weight 

30 aggregates that are stabilized by covalent and non-covalent bonds (not shown). The 
kinase activity also exhibited two peaks coinciding with the chromatographic profiles. 
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However, peak I showed a much higher specific activity than peak II, indicating that 
these high molecular weight aggregates contained a much more active form of the 
kinase. Equal volumes of rGPBP fractions numher 1 3 and 20 exhibited comparable 
phosphorylating activity, even though the protein content is approximately 20 times 

5 lower in fraction 13, as estimated by Western blot and Coomasie blue staining (Fig. 
1 1 A). The specific activities of rGPBP and rGPBPA26 at peak II are also different, and 
are consistent with the studies shown for the whole material, thus supporting the 
hypothesis that the presence of the 26-rediue serine-rich motif renders a more active 
kinase. These results also suggest that both rGPBP and rGPBPA26 exist as oligomers 

10 under native conditions, and that both high molecular weight aggregate formation and 
specific activity are greatly dependent on the presence of the 26-residue serine-rich 
motif. Analytical centrifugation analysis of rGPBP revealed that peak I contained large 
aggregates (over 10 7 Da). Peak II of rGPBP contained a homogenous population of 220 
± 10 kDa aggregates, likely representing trimers with a sedimentation coefficient of 

15 11S. Peak II of rGPBP A26 however consisted of a more heterogenous population that 
likely contains several oligomeric species. The main population (ca. 80%) displayed a 
weight average molecular mass of 310 ± 10 kDa and a coefficient of sedimentation of 
14S. 

GPBP and GPBPA26 self-interact in a yeast two-hybrid system. To assess 
20 the physiological relevance of the self-aggregation, and to determine the role of the 26- 
residue motif, we performed comparative studies using a two-hybrid interaction system 
in yeast. In this type of study, the polypeptides whose interaction is under study are 
expressed as a part of a fusion protein containing either the activation or the binding 
domains of the transcriptional factor GAL4. An effective interaction between the two 
25 fusion proteins through the polypeptide under study would result in the reconstitution 
of the transcriptional activator and the subsequent expression of the two reporter genes, 
Lac Z and His3, allowing colony color detection and growth in a His-defective 
* medium, respectively. We estimated the intensity of interactions by the growth-rate in 
histidine-defective medium, in the presence of different concentrations of a competitive 
30 inhibitor of the His3 gene product (3-AT), and a quantitative colorimetric liquid p- 
galactosidase assay. A representative experiment is presented in Fig. 12. When 
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assaying GPBPA26 for self-interaction, a significant induction of the reporter genes 
was observed, while no expression was detectable when each fusion protein was 
expressed alone or with control fusion proteins. The insertion of the 26-residue motif in 
the polypeptide to obtain GPBP resulted in a notable increase in polypeptide 
interaction. All of the above data indicate that GPBPA26 self-associates in vivo, and 
that the insertion of the 26-residues into the polypeptide chain yields a more interactive 
molecular species. 

GPBP is highly expressed in human but not in bovine and murine 
glomerulus and alveolus. We have shown that GPBP/GPBPA26 is preferentially 
expressed in human cells and tissues that are commonly targeted in naturally occurring 
autoimmune responses. To specifically investigate the expression of GPBP, we raised 
polyclonal antibodies against a synthetic peptide representing the 26-residue motif 
characteristic of this kinase isoform, and used it for immunohistochemical studies on 
frozen or formalin fixed paraffin embedded human tissues (Fig 13). In general, these 
antibodies showed more specificity than the antibodies recognizing both isoforms for 
the tissue structures that are target of autoimmune responses such as the biliary ducts, 
the Langerhans islets or the white matter of the central nervous system (not shown). 
Nevertheless, the most remarkable finding was the presence of linear deposits of 
GPBP-selective antibodies around the small vessels in every tissue studied (A), 
suggesting that GPBP is associated with endothelial basement membranes. 
Consequently, at the glomerulus, the anti-GPBP antibodies displayed a vascular pattern 
closely resembling the glomerular basement membrane staining yielded either by 
monoclonal antibodies specifically recognizing the a3(IV)NCl (compare 13B with 
13C and 13D), or by circulating GP autoantibodies (compare 13E and 13F). These 
observations further supported the initial observation that GPBP is expressed in tissue 
structures targeted in natural autoimmune responses, suggesting that the expression of 
GPBP is a risk factor and makes the host tissue vulnerable to an autoimmune attack. 

To further assess this hypothesis, we investigated the presence of GPBP and 
GPBPA26 in the glomerulus of two mammals that naturally do not undergo GP disease 
compared to human (Fig. 14). GPBP-specific antibodies failed to stain the glomerulus 
of both bovine or murine specimens (compare 14A with 14B and 14C) while antibodies 
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recognizing the N-terminal sequence common to both GPBP and GPBPA26 stained 
these structures in all three species, although with different distributions and intensities 
(14D-14F). In bovine renal cortex, GPBPA26 was expressed at a lower rate than in 
human, but showed similar tissue distribution. In murine samples, however, GPBPA26 

5 displayed a tissue distribution closely resembling that of GPBP in human glomerulus. 
Similar results were obtained when studying the alveolus in the three different species 
(not shown). To* rule out that the differences in antibody detection was due to primary 
structure differences rather than to a differential expression, we determined the 
corresponding primary structures in these two species by cDNA sequencing. Bovine 

10 and mouse GPBP (SEQ ID NOS:3-6 and 9-12) displayed an overall identity with 
human material of 97.9% and 96.6% respectively. Furthermore, the mouse 26-residue 
motif was identical to human while bovine diverged only in one residue. Finally, and 
similarly to human, wc successfully amplified GPBP cDNA from mouse or bovine 
kidney total RNA using oligonucleotides specific for the corresponding 78-bp exons, 

15 indicating that GPBP is expressed at very low levels not detectable by immunochemical 
techniques. 

GPBP is highly expressed in several autoimmune conditions. We analyzed 
several tissues from different GP patients by specific RT-PCR to assess 
GPBP/GPBPA26 mRNA levels. As in control kidneys, the major expressed isoform in 

20 GP kidneys was GPBPA26. However, in the muscle of one of the patients, GPBP was 
preferentially expressed, whereas GPBPA26 was the only isoform detected in control 
muscle samples (Fig. 15 A). Since we did not have kidney samples from this particular 
patient, we could not assess GPBP/GPBPA26 expression in the corresponding target 
organ. For similar reasons, we could not assess GPBP/GPBPA26 levels in the muscle of 

25 the patients in which kidneys were studied. Muscle cells express high levels of 
GPBP/GPBPA26 (see Northern blot in Fig. 9), and they comprise the bulk of the tissue. 
In contrast, the expression of GPBP/GPBPA26 in the kidney was much less, and the 
glomerulus was virtually the only kidney structure expressing the GPBP isoform (see 
Fig. 13). The glomerulus is a relatively less abundant structure in kidney than the 

30 myocyte is in muscle, and the glomerulus is the structure targeted by immune attack in 
GP pathogenesis. These factors, together with the preferential amplification of the more 
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abundant and shorter messages when performing RT-PCR studies, could account for 
the lack of detection of GPBP in both normal and GP kidneys, thus precluding the 
assessment of GPBP expression at the glomerulus during pathogenesis. Nevertheless, 
the increased levels of GPBP in a GP patient suggest that GPBP/GPBPA26 expression 
5 is altered during GP pathogenesis, and that augmented GPBP expression has a 
pathogenic significance in GP disease. 

To investigate the expression of GPBP and GPBPA26 in autoimnune 
pathogenesis, we studied cutaneous autoimmune processes and compared them with 
control samples representing normal skin or non-autoimmune dermatitis (Fig. 15). 

10 Control samples displayed a limited expression of GPBP in the most peripheral 
keratinocytes (15B, 15E), while keratinocytes expanding from stratum basale to 
corneum expressed abundant GPBP in skin affected by systemic lupus erythematosus 
(SLE) (15C, 1SF) or lichen planus (15D, 15G). GPBP was preferentially expressed in 
cell surface structures that closely resembled the blebs previously described in cultured 

15 keratinocytes upon UV irradiation and apoptosis induction (6), In contrast, antibodies 
recognizing both GPBP and GPBPA26 yielded a diffuse cytosolic pattern through the 
whole epidermis in both autoimmune affected or control samples (not shown). These 
data indicate that in both control and autoimmune-affected keratinocytes, GPBPA26 
was expressed at the cytosol and that the expression did not significantly vary during 

20 cell differentiation. In contrast, mature keratinocytes were virtually the only GPBP 
expressing cells. However, bleb formation and expression of GPBP was observed in the 
early stages of differentiation in epidermis affected by autoimmune responses (15C, 
15D, 15F, 15G). This further supports previous observations indicating that aberrant 
apoptosis at the basal keratinocytes is involved in the pathogenesis of autoimmune 

25 processes affecting skin (7), and suggests that apoptosis and GPBP expression are 
linked in this human cell system. 

DISCUSSION 

Alternative pre-mRNA splicing is a fundamental mechanism for differential 
gene expression that has been reported to regulate the tissue distribution, intracellular 
30 localization, and function of different protein kinases (8-11). In this regard, and closely 
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resembling GPBP, B-Raf exists as multiple spliced variants, in which the presence of 
specific exons renders more interactive, efficient and oncogenic kinases (12). 

Although it is evident that rGPBP A26 still bears the uncharacterized catalytic 
domain of this novel kinase, both auto- and trans-phosphorylating activities are greatly 

5 reduced when compared to rGPBP. Gel filtration and two hybrid experiments provide 
some insights into the mechanisms that underlie such a reduced phosphate transfer 
activity. About 1-2% of rGPBP is organized in very high molecular weight aggregates 
that display about one third of the phosphorylating activity of rGPBP, indicating that 
high molecular aggregation renders more efficient quaternary structures. Recombinant 

10 GPBPA26, with virtually no peak I material, consistently displayed a reduced kinase 
activity. However, aggregation does not seem to be the only mechanism by which the 
26-residues increases specific activity, since the rGPBPA26 material present in peak II 
also shows a reduced phosphorylating activity when compared to homologous fractions 
of rGPBP. One possibility is that rGPBP-derived aggregates display higher specific 

15 activities because of quaternary structure strengthening caused by the insertion of the 
26-residue motif. The oligomers are kept together mainly by very strong non-covalent 
bonds, since the bulk of the material appears as a single polypeptide in non-reducing 
SDS-PAGE, and the presence of either 8 M urea or 6 M guanidine had little effect on 
chromatographic gel filtration profiles (not shown). How the 26-residue motif renders a 

20 more strengthened and active structure remains to be clarified. Conformational changes 
induced by the presence of an exon encoded motif that alter the activation status of the 
kinase have been proposed for the linker domain of the Src protein (24) and exons 8b 
and 10 of B-Raf (12). Alternatively, the 26-residue motif may provide the structural 
requirements such as residues whose phosphorylation may be necessary for full 

25 activation of GPBP. 

We have reported (13) that the primary structure of the GP antigen (<x3(IV)NCl) is 
the target of a complex folding process yielding multiple conformers. Isolated conformers 
are non-minimum energy structures specifically activated by phosphorylation for 
supramolecular aggregation and likely quaternary structure formation. In GP patients, the 
30 a3(IV)NCl shows conformational alterations and a reduced ability to mediate the 
disulfide stabilization of the collagen IV network. The GP antibodies, in turn, demonstrate 



WO 00/50607 



45 



PCT/IB00/00324 



stronger affinity towards the patient a3(IV)NCl conformers, indicating that 
conformationally altered material caused the autoimmune response. Therefore, it seems 
that in GP disease an early alteration in the conforming process of the a3(IV)NCl could 
generate altered conformers for which the immune system is not tolerant, thus mediating 
5 the autoimmune response. 

Other evidence (Raya et al., unpublished results) indicates that phosphorylation is 
the signal that drives the folding of the a3(IV)NCl into non-minimum energy ends. In this 
scenario, three features of the human a3(IV)NCl system are of special pathogenic 
relevance when compared to the corresponding antigen systems from species that, like 

10 bovine or murine, do not undergo spontaneous GP disease. First, the N-terminus of the 
human a3(IV)NCl contains a motif that is phosphorylatable by PKA and also by GPBP 
(see above, and also 2-4). Second, the human gene generates multiples alternative 
products by alternative exon splicing (14,15). Exon skipping generates alternative 
products with divergent C-terminal ends that up-regulate the in vitro PKA 

15 phosphorylation of the primary a3(IV)NCl product (See below Example 3). Third, the 
human GPBP is expressed associated with glomerular and alveolar basement membranes, 
the two main targets in GP disease. The phosphorylation-dependent conforming process is 
also a feature of non-pathogenic NCI domains (13), suggesting that the phosphorylatable 
N-termimis, the alternative splicing diversification, and the expression of GPBP at the 

20 glomerular and alveolar basement membranes, are all exclusively human features that 
place the conformation process of <x3(IV)NCl in a vulnerable condition. The four 
independent GP kidneys studied expressed higher levels of GP antigen alternative 
products (15; Bernal and Saus, unpublished results), and an augmented expression of 
GPBP were found in a GP patient (see above). Both increased levels of alternative GP 

25 antigen products and GPBP are expected to have consequences in the phosphorylation- 
dependent conformational process of the a3(IV)NCl, and therefore with pathogenic 
potential. 

GPBP is highly expressed in skin targeted by natural autoimmune responses. In 
the epidermis, GPBP is associated with cell surface blebs characteristic of the 
30 apoptosis-mediated differentiation process that keratinocytes undergo during 
maturation from basale to corneum strata (22, 23). Keratinocytes from SLE patients 
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show a remarkably heightened sensitivity to UV-induced apoptosis (6, 18, 20), and 
augmented and premature apoptosis of keratinocytes has been reported to exist in SLE 
and dermatomyositis (7). Consistently, we found apoptotic bodies expanding from 
basal to peripheral strata of the epidermis in several skin autoimmune conditions 

5 including discoid lupus (not shown), SLE and lichen planus. Autoantigens, and 
modified versions thereof are clustered in the cell surface blebs of apoptotic 
keratinocytes (6,18,20). Apoptotic surface blebs present autoantigens (21), and likely 
release modified versions to the circulation (16-20). It has been suggested that the 
release of modified autoantigens from apoptotic bodies could be the immunizing event 

10 that mediates systemic autoimmune responses mediating SLE and scleroderma (18,19). 

Our evidence indicates that both GPBP and GPBPA26 are able to act in vitro as 
protein kinases, with GPBP being a more active isoform than GPBPA26. Furthermore, 
recombinant material representing GPBP or GPBPA26 purified from yeast or from 
human 293 cells contained an associated proteolytic activity that specifically degrades 

15 the ot3(IV)NCl domain (unpublished results). The proteolytic activity operates on 
a3(IV)NCl produced in an eukaryotic expression system, but not on recombinant 
material produced in bacteria (unpublished results), indicating that oc3(IV)NCl 
processing has some conformational or post-translational requirements not present in 
prokaryotic recombinant material. Finally, it has been reported that several autoantigens 

20 undergo phosphorylation and degradation in apoptotic keratinocytes (20). While not 
being limited to an exact mechanism, we propose, in light of all of the above data, that 
the machinery assembling GPBP at the apoptotic blebs likely performs a complex 
modification of the autoantigens that includes phosphorylation, conformational changes 
and degradation. Accordingly, recombinant protein representing autoantigens in SLE 

25 (PI ribosomal phosphoprotein and Sm-Dl small nuclear ribonucleoproteins) and in 
dermatomyositis (hystidil-tRNA synthetase) were in vitro substrates of GPBP 
(unpublished results). 

The down-regulation in cancer cell lines of GPBP, suggest that the cell 
machinery harboring GPBP/GPBPA26 is likely involved in signaling pathways 
30 inducing programmed cell death. The corresponding apoptotic pathway could be up 
regulated during autoimmune pathogenesis to cause an altered antigen presentation in 



WO 00/50607 



47 



PCT/IB00/00324 



individuals carrying specific MHC haplotypes; and down regulated during cell 
transformation to prevent autoimmune attack to the transformed cells during tumor 
growth. 
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Example 3. Regulation of Human Autoantigen Phosphorylation by Exon Splicing 

20 INTRODUCTION 

In GP disease, the immune system attack is mediated by autoantibodies against the 
non-collagenous C-terminal domain (NCI) of the a3 chain of collagen IV (the GP 
antigen) (1). The N-terminus of the human a3(IV)NCl contains a highly divergent and 
hydrophilic region with a unique structural motif, KRGDS 9 , that harbors a cell adhesion 

25 signal as an integral part of a functional phosphorylation site for type A protein kinases 
(2,3). Furthermore, the gene region encoding the human GP antigen characteristically 
generates multiple mRNAs by alternative exon splicing (4,5). The alternative products 
diverge in the C-terminal ends and all but one share the N-terminal KRGDS 9 (4,5). 

Multiple sclerosis (MS) is an exclusive human neurological disease characterized 

30 by the presence of inflamatory demyelization plaques at the central nervous system. (6). 
Several evidences indicate that this disease is caused by an autoimmune attack mediated 
by cytotoxic T cells towards specific components of the white matter including the myelin 
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basic protein (MBP) (7, 8). In humans, the MBP gene generates four products (MBP, 
MBPAII, MBPAV and MBPAII/V) that result from alternative exon splicing during pre- 
mRNA processing (9). Among these, MBPAII is the more abundant form in the mature 
central nervous system, while MBP form containing all the exons is virtually absent (9). 
5 Several biological similarities exist between the autoimme responses mediating 

GP disease and MS, namely: 1) both are human exclusive diseases and typically initiate 
after a viral flu-like disease; 2) a strong linkage exists to the same haplotype of the HLA- 
DR region of the class II MHC; 3) several products are generated by alternative splicing; 
and 4) the death of a MS patient by GP disease has recently been reported (10). 

10 

MATERIALS AND METHODS 

Synthetic polymers: GPAIII derived peptide, QRAHGQDLDALFVKVLRSP (SEQ 
ID NO:43) and GPAHI/TWV derived peptide, QRAHGQDLESLFHQL (SEQ ID 
NO:44) were synthesized using either Boc- (MedProbc) or Fmoc- (Chiron, Lipotec) 
15 chemistry. 

Plasmid construction and recombinant expression. 

GP derived material: The constructs representing the different GP-spliced 
forms were obtained by subcloning the cDNAs used elsewhere to express the 

20 corresponding recombinant proteins (5) into the BamHI site of a modified pETlSb 
vector, in which the extraneous vector-derived amino-terminal sequence except for the 
initiation Met was eliminated. The extra sequence was removed by cutting the vector 
with Ncol and Bam HI, filling-in of the free ends with Klenow, and re-ligation. This 
resulted in the reformation of both restriction sites and placed the BamHI site 

25 immediately downstream of the codon for the amino-terminal Met. 

The recombinant proteins representing GP or GPAV (SEQ ID NO:46) were 
purified by precipitation (5). Bacterial pellets containing the recombinant proteins 
representing GPAIII (SEQ ID NO:48) or GPAIII/IV/V (SEQ ID NO:50) were dissolved 
by 8 M urea in 40 raM Tris-HCl pH 6.8 and sonication. After centrifugation at 40,000 x 

30 g the supematants were passed through a 0.22 \*m filter and applied to resource Q 
column for FPLC. The effluent was acidified to pH 6 with HC1 and applied to a 
resource S column previously equilibrated with 40 mM MES pH 6 for a second FPLC 
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purification. The material in the resulting effluent was used for in vitro 
phosphorylation. 

MBP-derived material: cDNA representing human MBPAII (SEQ ID NO:51) 
was obtained by RT-PCR using total RNA from central nervous system. The cDNA 
representing human MBP was a generous gift from C. Campagnoni (UCLA). Both 
fragments were cloned into a modified version of pHIL-D2 (Invitrogen) containing a 
6xHis-coding sequence at the C-terminus to generate pHIL-MBPAII-His and pHIL- 
MBP-His, respectively. These plasmids were used for recombinant expression in Pichia 
pastoris as described in (12). Recombinant proteins were purified using immobilized 
metal affinity chromatography (TALON resin, CLONTECH) under denaturant 
conditions (8M urea) and eluted with 300 raM imidazole following manufacturers' 
instructions. The affinity-purified material was then renatured by dilution into 80 
volumes of 50 mM Tris-HCl pH 8.0, 10 mM CHAPS, 400 mM NaCl, 2 mM DTT, and 
concentrated 50 times by ultrafiltration through a YMlO-type membrane (AMICON). 
The Ser to Ala mutants were produced by site-directed mutagenesis over native 
sequence-containing constructs using transformer mutagenesis kit from CLONTECH 
and the resulting proteins were similarly produced. 

Phosphorylation studies. Phosphorylation studies were essentially done as 
described above (see also 3 and 12). In some experiments, the substrates were in-blot 
renatured and then, phosphorylated for 30 min at room temperature by overlaying 100 
^il of phosphorylation buffer containing 0.5 \ig of rGPBP. Digestion with V8 
endopeptidase and immunoprecipitation were performed as described in (3). 

Antibody production. Synthetic peptides representing the C-terminal divergent 
ends of GPAIII or GPAIII/IV/V comprised in SEQ ID NO:43 or SEQ ID NO:44 
respectively were conjugated to a cytochrome C, BSA or ovoalbumine using a 
glutaraldehyde coupling standard procedure. The resulting protein conjugates were 
used for mouse immmunization to obtain polyclonal antibodies specific for GPAIII and 
monoclonal antibodies specific for GPAIII/IV/V (Mabl53). To obtain monoclonal 
antibodies specific for GPAV (Mab5A) mouse were immunized using recombinant 
bacterial protein representing the corresponding alternative form comprising the SEQ 
ID NO:50. The production of monoclonal (M3/1, Pl/2) or polyclonal (anti-GPpepl) 
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antibodies against SEQ ID NO: 26 which represents the N-terminal region of the GP 
alternative forms have been previously described (3,5). 
Boc-based peptide synthesis. 

Assembling .The peptide was assembled by stepwise solid phase synthesis using 

5 a Boc-Benzyl strategy. The starting resin used was Boc-Pro-PAM resin (0.56 meq/g, 
batch R4108). The deprotection /coupling procedure used was: TFA (lxlmin) TFA (lx 
3 min) DCM (flow flash) Isopropylalcohol (lx 30 sec) DMF (3 x 1 min) 
COUPLING/DMF (1 xlO min) DMF (lxl min) COUPLING/DMF (lx 10 min) DMF 
(2x lmin) DCM (lx lmin). For each step 10 ml per gram of peptide-resin were used. 

10 The coupling of all amino acids (fivefold excess) was performed in DMF in the 
presence of BOP, Hobt and DIEA. For the synthesis the following side-chain protecting 
groups were used: benzyl for serine; 2 chlorobenzyloxycarbonyl for lysine; cyclohexyl 
for aspartic and glutamic acid; tosyl for histidine and arginine. 

Cleavage. The peptide was cleaved from the resin and fully deprotected by a 

15 treatment with liquid Hydrogen Fluoride (HF): Ten milliliters of HF per gram of 
peptide resin were added and the mixture kept at 0° C for 45 min in the presence of p- 
cresol as scavengers. After evaporation of the HF, the crude reaction mixture is washed 
with ether, dissolved in TFA, precipitated with ether and dried. 

Purification. Stationary phase: Silica CI 8, 15 ^m, 120 A; Mobile phase: solvent 

20 A: water 0. 1% TFA and solvent B: acetonitrile /A, 60/40 (v/v); Gradient: linear from 20 
to 60% B in 30 min; Flow rate: 40 ml/min; and detection was U.V (210 nm). Fractions 
with a purity higher than 80% were pooled and lyophilized. Control of purity and 
identity was performed by analytical HPLC and ES/MS. The final product had 88% 
purity and an experimental molecular weight of 2192.9. 

25 Fmoc-based peptide synthesis. 

Assembling. The peptides were synthesized by stepwise linear solid phase on 
Pro-clorotrityl-resin (0.685 meq/g) with standard Fmoc/tBu chemistry. The 
deprotection /coupling procedure used was: Fmoc aa (0.66 g) HOBt (0.26 g) DIPCDI 
(0.28 ml) for 40 min following a control by Kaiser test. If the test was positive the time 

30 was extended until change to negative. Then DMF (31 min), piperidine/DMF 20% (1 1 
min) piperidine/DMF 20% (15 min) and DMF (41 min). Side chain protectors were: 
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Pmc (pentamethylcromane sulfonyl) for arginine, Bcc (tert-butoxycarbonyl) for lysine, 
tBu (tert-butyl) for aspartic acid and for serine and Trl (trityl) for histidine. 

Cleavage. The peptide was cleaved and fully deprotected by treatment cleavage 
with TF A/water 90/10. Ten milliliters of TFA solution per gram of resin were added. 
5 Water acts as scavenger. After two hours, resin was filtered and the resulting solution 
was precipitated five times with cold diethylether. The final precipitated was dried. 

Purification. Stationary phase: Kromasil CI 8 10 jam; Mobile phase: solvent A: 
water 0. 1% TFA and solvent B: acetonitrile 0.1% TFA; Isocratic: 28% B; Flow rate: 55 
ml/min; Detection: 220 nm. Fractions with the higher purity were pooled and 
10 lyophilized, and a second HPLC purification round performed. Control of purity and 
identity was performed by analytical HPLC and ES/MS. The final product had 97% 
purity and an experimental molecular weight of 2190.9. 

RESULTS 

15 Regulation of the phosphorylation of the human GP antigen by alternative 

splicing. We produced bacterial recombinant proteins representing the primary antigen 
(GP) or the individual alternative products GPAV (SEQ ID NO:46), GPAHI (SEQ ID 
NO:48) and GPAIII/IV/V (SEQ ID NO:50), and we tested their ability to be 
phosphorylated by PKA (Figure 16, left panel ). Using standard ATP concentrations (150 

20 nM), ^ f° ur recombinant antigens were phosphorylated but to very different extents. The 
alternative forms incorporated 32 P more efficiently than the primary GP antigen, 
suggesting that they are better substrates. Because these antigens are expected to be in the 
extracellular compartment, we also assayed their phosphorylatability with more 
physiological ATP concentrations (0. 1-0.5 |aM). Under these conditions, the differences in 

25 32 P incorporation between the primary and alternative products were more evident, 
indicating that at low ATP concentrations the primary GP antigen was a very poor 
substrate for the kinase. Among the three PKA phosphorylation sites present in the GP 
antigen, the N-tenninal Ser 9 and Ser 26 are the major ones, and are common to all the 
alternative products assayed (3,5). Accordingly, the differences observed in 

30 phosphorylation for the full polypeptides also existed among the individual N-terminal 
regions, as determined after specific V8 digestion and immunoprecipitation (not shown). 
This strongly suggests that differences in phosphorylation might be due to the presence of 
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different C-terminal sequences in the alternative products. Since GPAIII and GPAIMV/V 
displayed significantly higher 32 P incorporation rates than GPAV, and they have shorter 
divergent C-terminal regions (5), we used synthetic peptides individually representing 
these C-terminal sequences (SEQ ID NO: 43, SEQ ID NO:44) to further examine their 

5 regulatory roles in the in vitro phosphorylation of the native antigen. Collagen IV is a 
trimeric molecule comprised of three interwoven a chains. In basement membranes, two 
collagen IV molecules assemble through their NCI domains to yield a hexameric NCI 
structure that can be solubilized by bacterial collagenase digestion (1). Dissociation of the 
hexamer structure releases the GP antigen in monomeric and disulfide-related dimeric 

10 forms (1). For the following set of experiments, we carried out phosphorylations in the 
presence of low, extracellular-like ATP concentrations using both monomeric or 
hexameric native GP antigen (Figure 16, right panel ). The presence of each specific 
peptide but not control peptides (not shown) induced the phosphorylation of a single 
polypeptide displaying an apparent MW of 22 kDa. By specific V8 digestion and 

15 immunoprecipitation, the corresponding polypeptide has been identified as the 22 kDa 
conformer of the a3(IV)NCl, previously characterized and identified as the best substrate 
forthePKA(ll). 

Regulation of the phosphorylation of the MBP by alternative splicing. The 

MBP contains at its N terminal region two PKA phosphorylation sites (Ser 8 , Ser 57 ) that are 
20 structurally similar to the N terminus site (Ser 9 ) present in GP antigen products (Fig 17). 
The Ser 8 site present in all the MBP proteins is located in a similar position than the Ser 9 
• in the GP-derived polypeptides. In addition, in the MBP and GPAIII Ser 8 and Ser 5 
respectively are at a similar distance in the primary structures of a highly homologous 
motif present in the corresponding exon II (bend arrow in Fig 17). The GPAIII-derived 
25 motif coincides with the C terminal divergent region that up-regulates PKA 
phosphorylation of Ser* in the GP antigen system (Fig. 16). The regulatory-like sequence 
in MBP is located at exon II and its presence in the final products depends on an 
alternative exon splicing mechanism. Therefore, the MBP motif identified by structural 
comparison to GPAIII may be also regulating PKA phosphorylation of Ser 8 . We produced 
30 recombinant proteins representing MBP and MBPAII (SEQ ID NO:54) and the 
corresponding Ser to Ala mutants to knock-out each of the two PKA phosphorylation sites 
(Ser 8 and Ser 57 ) present in exon I. Subsequently, we assessed its in vitro phosphorylation 
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by PKA (Fig. 18). MBPAII was a better substrate than MBP, and Ser 8 was the major 
phosphorylation site, indicating that, similarly to GP antigenic system, alternative exon 
splicing regulates the PKA phosphorylation of specific sites located at the N-tcrmina! 
region common to all the MBP-derived alternative forms. 

5 In similar experiments assessing GPBP phosphorylation of the recombinant MBP 

proteins, GPBP preferentially phosphorylated MBP, while little phosphorylation of 
MBPAII was observed (Fig. 19). Furthermore, recombinant Ser to Ala mutants displayed 
no significant reduction in 32 P incorporation, indicating that GPBP phosphorylates 
MBP/MBPAII in an opposite way than PKA, and that these two kinases do not share 

10 major phosphorylation sites in MBP proteins. 

From all these data we concluded that in the MBP system, alternative splicing 
regulates the phosphorylation of specific serines by either PKA or GPBP. 

Synthetic peptides representing the C terminal region of GPAIII influence 
GPBP phosphorylation. To assess the effect of the C terminal region of GPAIII on 

15 GPBP activity, peptides representing this region were synthesized using two different 
chemistries (Boc or Fmoc), and separately added to a phosphorylation mixture 
containing GPBP (Fig. 20), Boc-based synthetic peptides positively influenced GPBP 
autophosphorylation while Fmoc-based inhibited GPBP autophosphorylation, 
suggesting that the regulatory sequences derived from the alternative products in either 

20 GP and MBP antigenic systems can influence the kinase activity of GPBP. 

DISCUSSION 

We have shown that the ot3(IV)NCl domain undergoes a complex structural 
diversification by two different mechanism: 1) alternative splicing (4,5) and 2) 

25 conformational isomerization of the primary product (11). Both mechanisms generate 
products that are distinguished by PKA, indicating that PKA phosphorylation is a 
critical event in the biology of the a3(IV)NCl domain. Phosphorylation guides at least 
in part the folding, but also the supramolecular assembly of the a3(IV)NCl domain in 
the collagen IV network (1 1 and Raya et al. unpublished results). Altered conformers of 

30 the a3(IV)NCl lead the autoimmune response mediating GP disease (11), suggesting 
that an alteration in antigen phosphorylation could be the primary event in the onset of 
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the disease. Accordingly, we have found increased expression levels of GPAIII in 
several GP kidneys (4 and Bemal and Saus, unpublished results), and an increased 
expression of GPBP has been detected in another Goodpasture patient (Fig. 15). Both 
increased expression of alternative GP antigen products and of GPBP are expected to 

5 have consequences in the phosphorylation steady state of a3(IV)NCl, and therefore in 
the corresponding conformational process. The discrimination among the different 
structural products by PKA strongly suggests that this kinase, or another structurally 
similar kinase, is involved in the physiological antigen conforming process, and that 
antigen phosphorylation by GPBP has a pathogenic significance. In pathogenesis, 

10 GPBP could be an intruding kinase, interfering in the phosphorylation-dependent 
conforming process. Accordingly, GPBP is expressed in tissue structures that are 
targeted by natural autoimmune responses, and an increased expression of GPBP is 
associated with several autoimmune conditions (See examples 1 and 2 above). 

An alternative splicing mechanism also regulates the PKA phosphorylation of 

15 specific serines in the MBP antigenic system. MBP is also a substrate for GPBP 
suggesting that GPBP may play a pathogenic role in multiple sclerosis, and other 
autoimmune responses. 

All of the above data identify GPBP as a potential target for therapeutics in 
autoimmune disease. In Fig 20, we show that synthetic peptides representing the C 

20 terminal region of GPAIII (SEQ ID NO:43) modulate the action of GPBP in vitro , and 
therefore we identified this and related sequences as peptide-based compounds to 
modulate the activity of GPBP in vivo . The induction of GP antigen phosphorylation 
by PKA was achieved when using Boc-based peptides, but not when using similar 
Fmoc-based peptides. Furthermore, Boc- but not Fmoc-based peptides were in vitro 

25 substrates of PKA (not shown), indicating that important structural differences exist 
between both products. Since both products displayed no significant differences in 
mass spectrometry, one possibility is that the different deprotection procedure used may 
be responsible for conformational differences in the secondary structure that may be 
critical for biological activity. Accordingly, Boc-based peptide loses its ability to 

30 induce PKA upon long storage at low temperatures. 
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The present invention is not limited by the aforementioned particular preferred 
embodiments. It will occur to those ordinarily skilled in the art that various 
30 modifications may be made to the disclosed preferred embodiments without diverting 
from the concept of the invention. All such modifications are intended to be within the 
scope of the present invention. 
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I claim: 

L An isolated nucleic acid sequence comprising a sequence substantially similar 
to a nucleic acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID 
5 NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, 
SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:i9, SEQ ID NO:21, SEQ ID NO:23, and 
SEQIDNO:25. 

2. An isolated nucleic acid sequence comprising a nucleic acid sequence selected 
10 from the group consisting of SEQ ID NO:i, SEQ ED NO:3, SEQ ID NO:5, SEQ ID 

NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID N0.13, SEQ ID NO:15, SEQ ID NO:17, 
SEQ ID NO: 19, SEQ ID NO:2l, SEQ ID NO:23, and SEQ ID NO:25. 

3. An isolated nucleic acid comprising a sequence that encodes a polypeptide 
15 selected from the group consisting of GPBP, GPBPA26, and GPBPpepl, or fragments 

thereof. 

4. An isolated nucleic acid sequence comprising a sequence that encodes a protein 
sequence substantially similar to a protein sequence selected from the group consisting 

20 of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO.10, SEQ ID 
NO:12, SEQ ID NO:l4, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID 
NO:22, and SEQ ID NO:24 

5. An isolated nucleic acid sequence comprising a sequence that encodes a protein * 
25 sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 

NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ED NO:14, SEQ ID NO:16, 
SEQ ID NO:18, SEQ ED NO:20, SEQ ID NO:22, and SEQ ID NO:24. 



30 



6. A recombinant expression vector comprising the isolated nucleic acid sequence 
of any one of claims 1-5. 
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7. A recombinant expression vector comprising an isolated nucleic acid sequence 
comprising a nucleic acid sequence selected from the group consisting of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5> SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ 
ID NO:!3, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID 

5 NO:23, and SEQ ID NO:25, or fragments thereof 

8. A host cell transfected with the recombinant expression vector of claim 6 or 7. 

9. A substantially purified polypeptide, comprising an amino acid sequence 
10 substantially similar to a sequence selected from the group consisting of SEQ ID NO:2, 

SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO:12, SEQ ID 
N0.14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ED NO:22, SEQ ID 
NO:24, or peptide fragments thereof 

15 10. A substantially purified polypeptide, comprising an amino acid sequence 
selected from the group consisting of SEQ ED NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID 
NO: 1 8, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, or peptide fragments thereof. 

20 11. A substantially purified protein comprising a polypeptide selected from the 
group consisting of GPBP, GPBPA26, and GPBPpepl, or peptide fragments thereof 

12. An antibody that selectively binds to the substantially purified protein or 
polypeptide of any one of claims 9-11. 

25 

1 3 . The antibody of claim 12, wherein the antibody is a polyclonal antibody. 

14. The antibody of claim 12, wherein the antibody is a monoclonal antibody. 

30 15. A method for detecting the presence of a protein that is substantially similar to a 
protein selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
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NO:6, SEQ ID NO:8, SEQ ID NO.IO, SEQ ID N0.12, SEQ ID NO: 14, SEQ ID NO:16, 
SEQ ID NO: 1 8, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, comprising 

a) providing a protein sample to be screened; 

b) contacting the protein sample to be screened with the antibody of any 
5 one of claims 12-14 under conditions that promote antibody-antigen complex 

formation; and 

c) detecting the formation of antibody-antigen complexes, wherein the 
presence of the antibody-antigen complex indicates the presence of a protein that is 
substantially similar to a protein selected from the group consisting of SEQ ID NO:2, 

10 SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO:14, SEQ ID NO:16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:24. 

16. The method of claim 15, wherein detecting comprises a method selected from 
15 the group consisting of immunolocalization, immunofluorescence analysis, Western 

blot analysis, ELISAs, and nucleic acid expression library screening. 

17. A method for detecting in a sample a sequence that is substantially similar to a 
nucleic acid selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ 

20 ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO:15, 
SEQ ID NO: 17, SEQ ID N0.19, SEQ ID NO:21, SEQ ID NO:23, or SEQ ID NO:25, 
comprising contacting the sample with the isolated nucleic acid of any one of claims 1- 
5, or fragments thereof, and detecting complex formation, wherein complex formation 
indicates the presence in the sample of the sequence that is substantially similar to a 

25 nucleic acid selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO:15, 
SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, or SEQ ID NO:25. 

18. The method of claim 17, wherein the detecting is carried out by a method 
30 selected from the group consisting of hybridization, reverse transcription, PCR, coupled 

reverse transcription-PCR, Northern blotting, Southern blotting, and DNA library 
screening. 
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19. A method for detecting an autoimmune condition in a patient, comprising 

-providing a tissue or body fluid sample from the patient; 

-providing a control tissue or body fluid sample in which no autoimmune 
5 condition is present; and 

-detecting altered GPBP RNA or protein expression in the tissue or body fluid 
sample compared to the control sample, wherein an alteration in GPBP RNA or protein 
expression relative to the control indicates the presence of an autoimmune condition. 

10 20. A method for detecting cells undergoing apoptosis or cancer transformation in a 
tissue or body fluid sample, comprising 

-providing a tissue or body fluid sample from the patient; 
-providing a normal control tissue or body fluid sample; and 
-detecting altered GPBP RNA or protein expression in the tissue or body fluid 
15 sample compared to the control sample, wherein an alteration in GPBP RNA or protein 
expression relative to the control indicates the presence of cells undergoing apoptosis or 
cancer transformation. 

21. A method for treating a patient with an autoimmune disorder, comprising 
20 modifying the expression or activity of GPBP, GPBPA26, or a protein comprising a 

polypeptide substantially similarly to GPBPpepl in the patient with the autoimmune 
disorder. 

22. A method for treating a patient with a tumor, comprising modifying the 
25 expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 

substantially similarly to GPBPpepl in the patient with the tumor. 

23. A method for preventing cell apoptosis, comprising modifying the expression or 
activity of GPBP, GPBPA26, or a protein comprising a polypeptide substantially 

30 similarly to GPBPpep I in the cell. 
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24. The method of claim 21, 22, or 23 wherein alternative products of the 
Goodpasture antigen or of the myelin basic protein are used to modify the expression or 
activity of GPBP, GPBPA26 or a protein comprising a polypeptide substantially 
similarly to GPBPpepl. 

5 

25. The method of claim 21, 22, or 23 wherein nucleic acids comprising sequences 
substantially similar to SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO: 
5 1, or SEQ ID NO:53 or fragments thereof are used to modify the expression or activity 
of GPBP, GPBPA26 or a protein comprising a polypeptide substantially similarly to 

10 GPBPpepl. 

26. The method of claim 21, 22, or 23 wherein polypeptides comprising sequences 
substantially similar to SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:46, SEQ ID 
NO:48, SEQ ED NO:50, or SEQ ID NO:54, or fragments thereof are used to modify the 

15 expression or activity of GPBP, GPBPA26 or a protein comprising a polypeptide 
substantially similarly to GPBPpepl . 

27. An isolated nucleic acid sequence comprising a sequence that encodes a 
polypeptide substantially similar to an amino acid sequence selected from the group 

20 consisting of SEQ ID NO:43, SEQ ID NO:44, or peptide fragments thereof. 

28. An isolated nucleic acid sequence comprising a sequence that encodes a 
polypeptide selected from the group consisting of SEQ ID NO:43, SEQ ID NO:44, and 
peptide fragments thereof. 

25 

29. A recombinant expression vector comprising the isolated nucleic acid sequence 
of claim 27 or 28. 

30. A host cell transfected with the recombinant expression vector of claim 29. 

30 
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31. A substantially purified polypeptide, comprising an amino acid sequence 
substantially similar to a sequence selected from the group consisting of SEQ ID 
NO:43, SEQ ID NO:44, or peptide fragments thereof 

5 32. A substantially purified polypeptide, comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:43, SEQ ID NO:44, or peptide 
fragments thereof. 

33. An antibody that selectively binds to the substantially purified protein or 
10 polypeptide of claim 3 1 or 32. 

34. The antibody of claim 33, wherein the antibody is a polyclonal antibody. 

35. The antibody of claim 33, wherein the antibody is a monoclonal antibody. 

15 

36. The method of claim 21, 22, or 23 comprising administering a substantially 
purified polypeptide substantially similar to a polypeptide selected from the group 
consisting of SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ 
ID NO:50, or SEQ ID NO:54, or fragments thereof, to modify the expression or activity 

20 of GPBP, GPBPA26, or a protein comprising a polypeptide substantially similarly to 
GPBPpepl. 

37. The method of claim 21, 22, or 23 comprising administering an isolated nucleic 
acid comprising sequences substantially similar to SEQ ID NO:45, SEQ ID NO:47, 

25 SEQ ID NO:49, SEQ ID NO: 51, or SEQ ID NO:53 or fragments thereof, or fragments 
thereof, to modify the expression or activity of GPBP, GPBPA26, or a protein 
comprising a polypeptide substantially similarly to GPBPpepl. 

38. A pharmaceutical composition, comprising an amount effective of a 
30 substantially purified polypeptide substantially similar to a polypeptide selected from 

the group consisting of SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO:50, or SEQ ID NO:54, or fragments thereof, to modify the 
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expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 
substantially similarly to GPBPpepl, and a pharmaceutically acceptable carrier. 

39. A pharmaceutical composition* comprising an amount effective of a an isolated 
5 nucleic acid comprising sequences substantially similar to SEQ ID NO:45, SEQ ID 

NO:47, SEQ ID NO:49, SEQ ID NO: 51, or SEQ ID NO:53 or fragments thereof, to 
modify the expression or activity of GPBP, GPBPA26, or a protein comprising a 
polypeptide substantially similarly to GPBPpepl, and a pharmaceutically acceptable 
carrier. 

40. The method of claim 21, 22, or 23 comprising administering the pharmaceutical 
composition of claim 38 or 39 to modify the expression or activity of GPBP, 
GPBPA26, or a protein comprising a polypeptide substantially similarly to GPBPpepl. 
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GCAGGAAGATGGCGGCGGTAGCGGAGGTGTGAGTGGACGCGGGACTCAGCGGCCGGATTTTCTCTTCCCT 7 0 

TCTTTTCCCTTTTCCTTCCCTATTTGAAATTGGCATCGAGGGGGCTAAGTTCGGGTGGCAGCGCCGGGCG 14 0 

CAACGCAGGGGTCACGGCGACGGCGGCGGCGGCTGACGGCTGGAAGGGTAGGCTTCATTCACCGCTCGTC 210 

CTCCTTCCTCGCTCCGCTCGGTGTCAGGCGCGGCGGCGGCGCGGCGGGCGGACTTCGTCCCTCCTCCTGC 2 80 

TCCCCCCCACACCGGAGCGGGCACTCTTCGCTTCGCCATCCCCCGACCCTTCACCCCGAGGACTGGGCGC 3 50 

CTCCTC C GG CGCAGCTGAGGGAGCGGGGGCCGGTCTCCTGCTCGGTTGTCGAGCCTCCA TGTCGGA TAAT 4 20 

CAGAGCTGGAACTCGTCGGGCTCGGAGGAGGATCCAGAGACGGAGTCTGGGCCGCCTGTGGAGCGCTGCG 4 90 

g ja w....lf.„.s A -S 8 S JR JR !!™.BL 9 S.—.JP JR-.-.^^^^x. SL...,S 27 

GGGTCCTCAGTAAGTGGACAAACTACATTCATGGGTGGCAGGA7CGTTGGGTAGTTTTGAAAAATAATGC 5 60 

<3...y....L...A..JL..W_ 51 

TCTGAGTTACTACAAATCTGAAGATGAAACAGAGTA TGGCTGCAGAGGA TCCATCTGTCTTAGCAAGGCT 63 0 

74 

GTCArCACACCTCACGArTTTGATGAATGrCGATTTGArA TTAGTGTAAA TGATAGTGTTTGGTA TCTTC 700 

t_.p__h...d_^ 57 

gtgctcagga tccaga tca tagacagcaa tggataga tgcca ttgaacagcacaagactgaa tctggata 770 

r---a....q.„d....?...^ e s g y 121 

tgga tctgaa tccagcttgcgtcgacatggctcaa tggtgtccc7ggtgtctggagcaagtggctactct 84 0 

gsesslrrhg smvslvsgasgys 144 

gcaacatc ca cctcttcattcaagaaaggccacagtttacgtgagaagttggctgaaatggaaacattta 910 

ATSTSSFK KGHS LRE KL'AEM E T F 167 

GAGACATCTTATGTAGACAAGTTGACACGCTACAGAAGTACTTTGATGC CTGTG CTGATGCTGTCT CTAA 98 0 

RDILCRQVDTLQKYFDACADAVSK 191 

GGATGAACTTCAAAGGGATAAAGTGGTAGAAGATGATGAAGATGACTTTCCTACAACGCGTTCTGATGGT 1050 

DELQRDKVVEDDEDDPPTTRSD-G 214 

GACTTCTTGCATAGTACCAACGGCAATAAAGAAAAGTTATTTCCACATGTGACACCAAAAGGAATTAATG 112 0 

DFLHSTNGNKEKLFPHVTPKGIN 237 

GTATAGACTTTAAAGGGGAAGCGATAACTTTTAAAGCAACTACTC 1190 

GIDFKGEAITFKATTAGILATLSH 261 

TTGTATTGAACTAATGGTTAAACGTGAGGACAGCTGGCAGAAGAGACTGGATAAGGAAACTGAGAAGAAA 1260 

CIELMVKRED SWOKRLDKETEKK 284 

AGAAGAACAGAGGAAGCATATAAAAATGC AATGACAGAACTTAAGAAAAAATC CC ACTTTGGAGGAC C AG 13 3 0 

RRTSBAYKNAMTBLKXKSH? G G P 307 

ATTATGAAGAAGGCCCTAACAGTCTGATTAATGAAGAAGAGTTCTTTGATGCTGTTGAAGCTGCTCTTGA 1400 

DYEEGPNSLINEEEFFDAVEAALD 331 
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CAGACAAGATAAAATAGAAGAACAGTCACAGAGTGAAAAGGTGAGATTACATTGG C CTACATCCTTGCCC 1470 
RQDKIEEQS QSEKVRLHWPTS LP 354 

TCTGGAGATGCCTTTTCTTCTGTGGGGACACATAGATTTGTCCAAAAGCCCTATAGTCGCTCTTCCTCCA 1540 
SGDAFSSVGTHRFVQK P Y S R S S S 377 

TGTCTTCCATTGATCTAGTCAGTGCCTCTGATGATGTTCACAGATTCAGCTCCCAGGTTGAAGAGATGGT 1610 
MSSIDtiVSASDDVHRFSSO V E E M V 401 

GCAGAACCACATGACTTACTCATTACAGGATGTAGGCGGAGATGCCAATTGGCAGTTGGTTGTAGAAGAA 1680 
QNHMTYS LQDVGGDANWQLVV EE 424 

GGAGAAATGAAGGTATAGAGAAGAGAAGTAGAAGAAAATGGGATTGTTCTGGATCCTTTAAAAGCTACCC 1750 
GEMKVYRREVEENGIVIiDPLKAT 447 

ATGCAGTTT^AAGGCGTCACAGGACATGAAGTCTGCAATTATTTCTGGAATGTTGACGTTCGCAATGACTG 182 0 
HAVKGVTGHEVCNY FWNVDVRNDW 471 

GGAAACAACTATAGAAAACTTTCATGTGGTGGAAACATTAGCTGATAATGCAATCATCATTTATCAAA^ 18 90 
ETTIENFHVVETLADNAI IIYQT 494 

CACAAGAGGGTGTGGCCTGCTTCTCAGCGAGACGTATTATATCTTTCTGTCATTCGAAAGATACCAGCCT 1960 
KKRVWPASQRDVLYLSVIRKI PA 517 

TGACTGAAAATGAC CCTGAAACTTGGATAGTTTGTAATTTTTCTGTGGATCATGACAGTGCTC CTCTAAA 20 3 0 
LTENDPETWIVCNF SVDHDSAPLN 541 

CAACCGATGTGTCCGTGCCAAAATAAATGTTGCTATGATTTGTCAAACCTTGGTAAGCCCACCAGAGGGA 2100 
NRCVRAKINVAMICQTLVSPPEG 564 

AACCAGGAAATTAGCAGGGACAACATTCTATGCAAGATTACATATGTAGCTAATGTGAACCCTGGAGGAT 2170 
NQEI SRDNILCKITYVANVNPGG 587 

GGGCACCAGCCTCAGTGTTAAGGGCAGTGGCAAAGCGAGAGTATCCTAAATTTCTAAAACGTTTTACTTC 2240 
WAPASVLRAVAKREYPKFLKRFTS611 

TTACGTCCAAGAAAAAACTGCAGGAAAGCCTATTTTGTTCTAGTATTAACAGGTACTAGAAGATA7GTTT 2310 
YVQE KTAGKPILF 624 

TAT CITTTTTTAACTTTATTTGACTAATATGACTGTCAATACrAAAATTTAGTTGTTGAAAGT^ 2 3 80 

ATGTTTTTT 2 3 89 
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SEQUENCE LISTING 

<110> Saus, Juan 

<120> Goodpasture Binding Protein 

<130> 98-723-B 

<140> To Be Assigned 
<141> Filed Herewith 

<160> 54 

<170> Patentln Ver. 2.0 

<210> l 
<211> 23B9 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (409) . . (2280) 
<400> 1 

gcaggaagat ggcggcggta gcggaggtgt gagtggacgc gggactcagc ggccggattt 60 

tctcttccct tcttttccct tttccttccc tatttgaaat tggcatcgag ggggctaagt 12 0 

tcgggtggca gcgccgggcg caacgcaggg gtcacggcga cggcggcggc ggctgacggc 180 

tggaagggta ggcttcattc accgctcgtc ctccttcctc gctccgctcg gtgtcaggcg 240 

cggcggcggc gcggcgggcg gacttcgtcc ctcctcctgc tcccccccac accggagcgg 3 00 

gcactcttcg cttcgccatc ccccgaccct tcaccccgag gactgggcgc ctcctccggc 360 

gcagctgagg gagcgggggc cggtctcctg ctcggttgtc gagcctcc atg teg gat 417 

Met Ser Asp 

1 

aat cag age tgg aac teg teg ggc teg gag gag gat cca gag acg gag 465 
Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
5 10 15 

tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg aca aac 513 
Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
20 25 30 35 

tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat aat get 561 
Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
40 45 50 

ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc aga gga 609 
Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
55 60 65 

tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt gat gaa 657 
Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
70 75 80 
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tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt cgt get 7 05 

Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
85 90 95 

cag gat cca gat cat aga cag caa tgg ata gat gec att gaa cag cac 753 

Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala He Glu Gin His 

100 105 110 115 

aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga cat ggc 8 01 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
120 125 130 

tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca aca tec 84 9 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
135 140 145 

ace tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg get gaa 897 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
150 155 160 

atg gaa aca ttt aga gac ate tta tgt aga caa gtt gac acg eta cag 945 

Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin 
165 170 175 



aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa ctt caa 
Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
180 185 190 195 



993 



agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca acg cgt 1041 
Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
200 205 210 

tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa aag tta 1089 
Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
215 220 225 

ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt aaa ggg 1137 
Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp Phe Lys Gly 
230 235 240 

gaa gcg ata act ttt aaa gca act act get gga ate ctt gca aca ctt 1185 
Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu 
245 250 255 

tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg cag aag 1233 
Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
260 265 270 275 

aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa gca tat 1281 
Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr 
280 285 290 

aaa aat gca atg aca gaa ctt aag aaa aaa tec cac ttt gga gga cca 1329 
Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro 
295 300 305 

gat tat gaa gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt 1377 
Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe 
310 315 320 

gat get gtt gaa get get ctt gac aga caa gat aaa ata gaa gaa cag 1425 
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Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin 
325 330 335 

tea cag agt gaa aag gtg aga tta cat tgg cct aca tec ttg ccc net 1473 
Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser 
340 345 350 355 

gga gat gee ttt tct tct gtg ggg aca cat aga ttt gtc caa aag ccc 1521 
Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro 
360 365 370 

tat agt cgc tct tec tec atg tct tec att gat eta gtc agt gee tct 1569 
Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val ser Ala Ser 
375 380 * 385 

gat gat gtt cac aga ttc age tec cag gtt gaa gag atg gtg cag aac 1617 
Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn 
390 395 400 

cac atg act tac tea tta cag gat gta ggc gga gat gee aat tgg cag 1665 
His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin 
405 410 415 

ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta gaa 1713 
Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu 
420 425 430 " " 435 

gaa aat ggg att gtt ctg gat cct tta aaa get ace cat gca gtt aaa 1761 
Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys 
440 445 450 

ggc gtc aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac gtt 1809 
Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val 
455 460 " 465 

cgc aat gac tgg gaa aca act at a gaa aac ttt cat gtg gtg gaa aca 1857 
Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr 
470 475 480 

tta get gat aat gca ate ate att tat caa aca cac aag agg gtg tgg 1905 
Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys Arg Val Trp 
• 485 490 495 

cct get tct cag cga gac gta tta tat ctt tct gtc att cga aag ata 1953 
Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys He 
500 505 510 515 

cca gee ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt 2 001 
Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe 
520 525 530 

tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc cgt gee 2 04 9 
Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala 
535 540 545 

aaa ata aat gtt get atg att tgt caa acc ttg gta age cca cca gag 2097 
Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro Pro Glu 
550 555 560 

gga aac cag gaa att age agg gac aac att eta tgc aag att aca tat 2145 
Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr 
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565 570 575 

gta get aat gtg aac cct gga gga tgg gca cca gec tea gtg tta agg 2193 
Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg 
580 585 590 595 

gca gtg gca aag cga gag tat cct aaa ttt eta aaa cgt ttt act tct 2241 
Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser 
600 605 610 

tac gtc caa gaa aaa act gca gga aag cct att ttg ttc tagtattaac 2290 
Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
615 620 

aggtactaga agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat 2350 

actaaaattt agttgttgaa agtatttact atgtttttt 2389 



<210> 2 
<211> 624 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 30 

Trp Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val Tie Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 * 110 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 17 0 ~ 175 

Thr Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 
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Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 ' 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu 
275 280 285 

Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 " 350 

Leu Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 " 365 

Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val 
370 375 380 

Ser Ala Ser Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met 
385 390 395 400 

Val Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 

Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
420 425 430 

Glu Val Glu Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

Val Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val 
465 470 475 480 

Val Glu Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys 
485 490 * 495 

Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He 
500 505 510 



Arg Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val 
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515 520 525 

Cys Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lys lie Asn Val Ala Met lie Cys Gin Thr Leu Val Ser 
545 550 555 560 

Pro Pro Glu Gly Asn Gin Glu lie Ser Arg Asp Asn lie Leu Cys Lys 
565 570 575 

He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 ' 605 

Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 3 

<211> 2762 

<212> DNA 

<213> Mus rausculus 

<220> 
<221> CDS 

<222> (444) . . (2315) 
<400> 3 

cgggccacca cgtgtaaata gtatcggacc cggcaggaag atggcggctg tagcggaggt 60 

gtgagtgagt ggatctgggt ctctgccgtt ggcttggctc ttcccgtctt cctcccctcc 12 0 

tccctccctg acxgaggttg gcatctaggg ggccgagttc aggtggcggc gccgggcgca 180 

gcgcaggggt cacggccacg gcggctgacg gctggaaggg caggctttct tcgccgctcg 240 

tcctccttcc ccggtccgct cggtgtcagg cgcggcggcg gcggcgcggc gggcgcgctt 300 

cgtccctctt cctgttccct cactccccgg agcgggctct cttggcggtg ccatcccccg 360 

acccttcacc ccagggacta ggcgcctgca ctggcgcagc tcgcggagcg ggggccggtc 420 

tcctgctcgg ctgtcgcgtc tec atg teg gat aac cag age tgg aac teg teg 473 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser 
15 10 

ggc teg gag gag gat ccg gag acg gag tec ggg ccg cct gtg gag cgc 521 
Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val Glu Arg 
15 20 25 

tgc ggg gtc etc age aag tgg aca aac tat att cat gga tgg cag gat 569 
Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr He His Gly Trp Gin Asp 
30 35 40 

cgt tgg gta gtt ttg aaa aat aat act ttg agt tac tac aaa tct gaa 617 
Arg Trp Val Val Leu Lys Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu 
45 50 55 
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gat gaa aca gaa tat ggc tgt agg gga tec ate tgt ctt age aag get 665 

Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser lie Cys Leu Ser Lys Ala 

60 65 70 

gtg ate acg cct cac gat ttt gat gaa tgc egg ttt gat ate agt gta 713 

Val He Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp He Ser Val 

75 80 85 90 

aat gat agt gtt tgg tac ctt cga get cag gac ccg gag cac aga cag 761 

Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Glu His Arg Gin 

95 100 105 



caa tgg gta gac .gee att gaa cag cac aag act gaa teg gga tat gga 
Gin Trp Val Asp Ala He Glu Gin His Lys Thr Glu Ser Gly Tyr Gly 
110 115 120 



809 



tct gag tec age ttg cgt aga cat ggc tea atg gtg tea ctg gtg tct 857 
Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu Val Ser 
125 130 135 

gga gcg agt ggc tat tct get acg tec acc tct tct ttc aag aaa ggc 90 5 
Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly 
140 145 150 

cac agt tta cgt gag aaa ctg get gaa atg gag aca ttt egg gac ate 95 3 
His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg Asp lie 
155 160 165 170 

ctg tgc egg cag gtt gat act etc cag aag tac ttt gat gtc tgt get 1001 
Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala 
175 180 185 

gac get gtc tec aag gat gag ctt cag agg gat aaa gtc gta gaa gat 1049 
Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val Glu Asp 
190 195 200 

gat gaa gat gac ttc cct aca act cgt tct gat gga gac ttt ttg cac 1097 
Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe Leu His 
205 210 215 

aat acc aat ggt aat aaa gaa aaa tta ttt cca cat gta aca cca aaa 1145 
Asn Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr Pro Lys 
220 225 230 

gga att aat ggc ata gac ttt aaa ggg gaa gca ata act ttt aaa gca 1193 
Gly He Asn Gly He Asp Phe Lys Gly Glu Ala He Thr Phe Lys Ala 
235 240 245 250 

act act get gga ate ctt get aca ctt tct cat tgt att gaa tta atg 1241 
Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys He Glu Leu Met 
255 260 265 

gta aaa egg gaa gag age tgg caa aaa aga cac gat agg gaa gtg gaa 12 8 9 
Val Lys Arg Glu Glu Ser Trp Gin Lys Arg His Asp Arg Glu Val Glu 
270 275 280 

aag agg aga cga gtg gag gaa gcg tac aag aat gtg atg gaa gaa ctt 133 7 
Lys Arg Arg Arg Val Glu Glu Ala Tyr Lys Asn Val Met Glu Glu Leu 
285 ^ 290 295 

aag aag aaa ccc cgt ttc gga ggg ccg gat tat gaa gaa ggt cca aac 13 8 5 
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Lys Lys Lys Pro Arg Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
300 305 310 

agt ctg att aat gag gaa gag ttc ttt gat get gtt gaa get get ctt 1433 

Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 

315 320 * 325 330 

gac aga caa gat aaa ata gag gaa cag tea cag agt gaa aag gtc agg 14 81 

Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
335 340 345 

tta cac tgg ccc aca tea ttg cca tct gga gac ace ttt tct tct gtc 1529 

Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Thr Phe Ser Ser Val 
350 355 * 360 

ggg acg cat aga ttt gta caa aag ccc tat agt cgc tct tec tec atg 1577 

Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met 
365 370 375 

tct tec att gat eta gtc agt gee tct gac gat gtt cac aga ttc age 1625 

Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp Vai His Arg Phe Ser 
380 385 " 390 

tec cag gtt gaa gaa atg gta cag aac cac atg aac tat tea tta cag 1673 

Ser Gin Val Glu Glu Met Val Gin Asn His Met Asn Tyr Ser Leu Gin 

395 400 405 410 

gat gta ggt ggt gat gca aat tgg caa ctg gtt gtt gaa gaa gga gaa 1721 

Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
415 420 425 

atg aag gta tac aga aga gaa gtg gaa gaa aat gga att gtt ctg gat 1769 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 
430 435 * 440 

cct ttg aaa get act cat gca gtt aaa ggt gtt aca gga cat gag gtc 1817 

Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
445 450 * 455 

tgc aat tac ttt tgg aat gtt gat gtt cgc aat gac tgg gaa act act 1865 

Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
460 465 470 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 1913 

He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He 

475 480 485 490 

gtt tat caa acg cac aag aga gta tgg ccc get tct cag aga gac gta 1961 

val Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
495 500 505 

ctg tat ctt tct get att cga aag ate cca gee ttg act gaa aat gat 2009 

Leu Tyr Leu Ser Ala He Arg Lys He Pro Ala Leu Thr Glu Asn Asp 
510 515 520 

cct gaa act tgg ata gtt tgt aat ttt tct gtg gat cat gat agt get 2057 

Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
525 530 535 

cct ctg aac aat cga tgt gtc cgt gee aaa ate aat att get atg att 2105 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn He Ala Met He 
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540 545 550 

tgt caa act tta gta age cca cca gag gga gac cag gag ata age aga 2153 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asp Gin Glu lie Ser Arg 
555 560 565 570 

gac aac att ctg tgc aag ate acg tat gta get aat gtg aac cca gga 22 01 
Asp Asn lie Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
575 580 585 

gga tgg gcg cca get teg gtc tta aga gca gtg gca aag cga gaa tac 2249 
Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
590 595 600 

cct aag ttt eta aaa cgt ttt act tct tat gtc caa gaa aaa act gca 2297 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
605 610 615 

gga aaa cca att ttg ttt tagtattaac agtgactgaa geaaggctge 2345 
Gly Lys Pro He Leu Phe 
620 



gtgacgttcc 


atgttggaga 


aaggagggaa 


aaaataaaaa 


gaatcctcta 


agctggaacg 


2405 


taggatctac 


agecttgtet 


gtggcccaag 


aagaaacatt 


geaategtaa 


agctgggtat 


2465 


ccagcactag 


ccatctcctg 


ctaggcctcc 


tcgctcagcg 


cgtaactata 


aatacatgta 


2525 


gaatcacatg 


gatatggcta 


tatttttatt 


tgcttgctcc 


ttggagtgaa 


aacaaataac 


2585 


tttgaattac 


aactaggaat 


taaccgatgc 


tttaattttg 


aggaactttt 


tcagaatttt 


2645 


ttatttacca 


tggtccaacc 


taagatcctc 


agttgtatca 


agtttttgtg 


cacaaaagaa 


2705 


aagcacaaaa 


gttgaacgea 


cctgaaggca 


tgtgctctct 


gtgcaacaaa 


tactcag 


2762 



<210> 4 
<211> 624 
<212> PRT 

<213> Mus musculus 
<400> 4 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 
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Leu Arg Ala Gin Asp Pro Glu His Arg Gin Gin Trp Val Asp Ala He 
100 105 110 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 ~ 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Glu Ser 
260 265 270 

Trp Gin Lys Arg His Asp Arg Glu Val Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Val Met Glu Glu Leu Lys Lys Lys Pro Arg Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 350 

Leu Pro Ser Gly Asp Thr Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val 
370 375 380 

Ser Ala Ser Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met 
385 390 395 400 

Val Gin Asn His Met Asn Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 



Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
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420 



425 



430 



Glu Val Glu Glu Asn Gly lie Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

Val Asp Val Arg Asn Asp Trp Glu Thr Thr lie Glu Asn Phe His Val 
465 470 475 480 

Val Glu Thr Leu Ala Asp Asn Ala lie He Val Tyr Gin Thr His Lys 
485 490 495 

Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 505 510 

Arg Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

Cys Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lys He Asn He Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

Pro Pro Glu Gly Asp Gin Glu lie Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 



Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 5 
<211> 2361 
<212> DNA 
<213> Bos taurus 

<220> 
<221> CDS 

<222> (421) . . (2292) 
<400> 5 

cggcaggaag atggcggcct agcggaggtg tgagtggacc tgggtctctg cagctgggtt 60 
ttccctcttc ccgtctttct cctcttttcc tctcccccga ggttggcatc gagggggcca 120 
aattcgggcg gcggcgccgg gcgcagcgca ggggtcacaa cgacggcgac ggctgacggt 180 
tggaagggca ggcttccttc gcccctcgac ctccttcccc ggtccgcttg gtgtcaggcg 240 
cggcggcggc ggcggcggcg gcgcggcggg cggactccat ccctcctccc gctccctcct 300 
gcaccggagc gggcactcct tccttcgcca tcccccgacc cttcaccccg gggactgggc 360 
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gcctccaccg gcgcagctca gggagcgggg gccggtctcc tgctcggctg tcgcgcctcc 420 

atg teg gat aac cag age tgg aac teg teg ggc teg gag gag gat ccg 468 
Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 IS 

gag acg gag tec ggg ccg ccg gtg gag cgc tgc gga gtc etc aac aag 516 
Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 

tgg aca aac tat att cat ggg tgg cag gat cgc tgg gta gtt ttg aaa 5 64 
Trp Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

aat aac act ctg agt tac tac aaa tct gaa gat gag aca gag tat ggc 612 
Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

tgc aga gga tec ate tgt ctt age aag get gtc ate acg cct cat gat 660 
Cys Arg Gly Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

ttt gat gaa tgc cga ttt gat att agt gta aat gat agt gtt tgg tat 708 
Phe Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

ctt cgt get caa gat cca gat cac aga cag cag tgg ata gat gee att 756 
Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 110 

gaa cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt 8 04 
Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

cga cat ggc tec atg gta tea ttg gta tec gga gca agt ggc tat tct 852 
Arg Hi3 Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

gca aca tec ace tec tea ttc aag aag ggc cac agt tta cgt gag aaa 900 
Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

ctg get gaa atg gaa acc ttt aga gat ata ctg tgt aga caa gtt gat 94 8 
Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 175 

acc eta cag aag ttc ttt gat gee tgt get gat get gtc tec aag gat 996 
Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

gaa ttt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct 1044 
Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

acg aca cgt tct gat gga gac ttc ttg cat aat acc aat ggc aat aag 10 92 
Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

gaa aag gta ttt cca cat gta aca cca aaa gga att aat ggt ata gac 1140 
Glu Lys Val Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 
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ttt aaa ggt gag gcg ata act ttt aaa gca act act gcc gga ate ctt 1188 
Phe Lys Gly Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

get aca ctt tct cat tgt att gag ctg atg gta aaa cgt gag gac age 123 6 
Ala Thr Leu Ser His Cys lie Glu Leu Met' Val Lys Arg Glu Asp Ser 
260 265 270 

tgg caa aag aga atg gac aag gaa act gag aag aga aga aga gtg gag 12 84 
Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285, 

gaa gca tac aaa aat gcc atg aca gaa ctt aag aaa aaa tec cac ttt 1332 
Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

gga gga cca gat tat gag gaa ggc cca aac agt ttg att aat gaa gag 1380 
Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

gag ttc ttt gat get gtt gaa get get ctt gac aga caa gat aaa ata 142 8 
Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 ' " 335 

gaa gaa cag teg cag agt gaa aag gtc agg tta cat tgg tct act tea 1476 
Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

atg cca tct gga gat gcc ttt tct tct gtg ggg act cat aga ttt gtc 1524 
Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 1 365 

caa aag ccc tat agt cgc tct tec tec atg tct tec att gat eta gtc 1572 
Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val 
370 375 380 

agt gcc tct gac ggt gtt cac aga ttc age tec cag gtt gaa gag atg 1620 
Ser Ala Ser Asp Gly Val His Arg Phe Ser Ser Gin Val Glu Glu Met 
385 390 395 400 

gtg cag aac cac atg ace tat tea ttg cag gat gta ggt ggg gac gcc 1668 
Val Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 

aac tgg cag ttg gtt gta gaa gaa ggg gag atg aag gta tat aga aga 1716 
Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
420 425 * 430 

gaa gta gaa gaa aat ggg att gtt ctg gat cct ttg aaa get acc cat 1764 
Glu Val Glu Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

gca gtt aaa ggc gtt aca gga cac gag gtc tgc aat tac ttc tgg aat 1812 
Ala val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

gtt gat gtt cgc aat gat tgg gaa aca act ata gaa aac ttt cat gtg 1860 
Val Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val 
465 470 475 480 
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gtg gaa aca tta get gat aat gca ate ate att tat caa acg cac aag 1908 
Val Glu Thr Ijeu Ala Asp Asn Ala lie He He Tyr Gin Thr Kis Lys 
485 490 495 

aga gtg tgg cca gee tct cag egg gat gtc tta tat ctg tct gee att 1956 
Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 505 510 

cga aag ata cca get ttg aat gaa aat gac ccg gag act tgg ata gtt 2004 
Arg Lys lie Pro Ala Leu Asn Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

tgt aat ttt tct gta gat cac age agt get cct eta aac aat cga tgt 2052 
Cys Asn Phe Ser val Asp His Ser Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

gtc cgt gec aaa ata aac gtt get atg att tgt cag acc ttg gtg age 2100 
Val Arg Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

ccc cca gag gga aac cag gag att age agg gac aac att eta tgc aag 2148 
Pro Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

att aca tac gtg gee aat gta aac cct gga gga tgg gec cca gec tea 2196 
He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

gtg tta egg gca gtg gca aag cga gaa tat cca aag ttt eta aag cgt 2244 
Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

ttt act tct tac gta caa gaa aaa act gca gga aaa cct att ttg ttc 22 92 
Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
610 615 620 

tagtattaac agtgactgaa gcaaggctgt gtgacattcc atgttggagg aaaaaaaaaa 23 52 

aaaaaaaaa 23 61 



<21Q> 6 

<211> 624 

<212> PRT 

<213> Bos taurus 

<400> 6 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val lie Thr Pro His Asp 
65 70 75 80 
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Phe Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser 
85 90 



Val Trp Tyr 
95 



Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie 
100 105 no 

Glu Gin Kis Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 ~ 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 " " 175 

Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Val Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val 
370 375 380 

Ser Ala Ser Asp Gly Val His Arg Phe Ser Ser Gin Val Glu Glu Met 
385 390 395 400 
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Val Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 

Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
420 425 " 430 

Glu Val Glu Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

Val Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val 
465 470 475 480 

Val Glu Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys 
485 490 495 

Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 505 510 

Arg Lys He Pro Ala Leu Asn Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

Cys Asn Phe Ser Val Asp His Ser Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

Pro Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 7 

<211> 2187 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (391) . . (2187) 
<400> 7 

tagcggaggt gtgagtggac gcgggactca gcggccggat tttctcttcc cttcttttcc 60 
cttttccttc cctatttgaa attggcatcg agggggctaa gttcgggtgg cagcgccggg 120 
cgcaacgcag gggtcacggc gacggcggcg gcggctgacg gctggaaggg taggcttcat 180 
tcaccgctcg tcctccttcc tcgctccgct cggtgtcagg cgcggcggcg gcgcggcggg 24 0 
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cggacttcgt ccctcctcct gctccccccc acaccggagc gggcactctt cgcttcgcca 300 

tcccccgacc cttcaccccg aggactgggc gcctcctccg gcgcagctga gggagcgggg 360 

gccggtctcc tgctcggttg tcgagcctcc atg teg gat aat cag age tgg aac 414 

Met Ser Asp Asn Gin Ser Trp Asn 

1 5 

teg teg ggc teg gag gag gat cca gag acg gag tct ggg ccg cct gtg 4 62 
Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val 
10 15 20 

gag cgc tgc ggg gtc etc agt aag tgg aca aac tac att cat ggg tgg 510 
Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr lie His Gly Trp 
25 30 35 4Q 

cag gat cgt tgg gta gtt ttg aaa aat aat get ctg agt tac tac aaa 558 
Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala Leu Ser Tyr Tyr Lys 
45 50 * 55 

tct gaa gat gaa aca gag tat ggc tgc aga gga tec ate tgt ctt age 606 
Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser He Cys Leu Ser 
60 65 70 

aag get gtc ate aca cct cac gat ttt gat gaa tgt cga ttt gat att 654 
Lys Ala Val He Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp He 
75 80 85 

agt gta aat gat agt gtt tgg tat ctt cgt get cag gat cca gat cat 702 
Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Asp His 
90 95 100 

aga cag caa tgg ata gat gec att gaa cag cac aag act gaa tct gga 750 
Arg Gin Gin Trp He Asp Ala He Glu Gin His Lys Thr Glu Ser Gly 
105 110 H5 120 

tat gga tct gaa tec age ttg cgt cga cat ggc tea atg gtg tec ctg 798 
Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu 
125 130 135 

gtg tct gga gca agt ggc tac tct gca aca tec acc tct tea ttc aag 846 
■Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys 
140 145 150 

aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca ttt aga 894 
Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
155 160 165 

gac ate .tta tgt aga caa gtt gac acg eta cag aag tac ttt gat gec 942 
Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
170 175 180 

tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa gtg gta 990 
Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
185 190 195 " 200 

gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt gac ttc 1038 
Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
205 210 215 

ttg cat agt acc aac ggc aat aaa gaa aag tta ttt cca cat gtg aca 1086 
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Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr 
220 225 230 

cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata act ttt 1134 
Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly Glu Ala lie Thr Phe 
235 240 245 

aaa gca act act get gga ate ctt gca aca ctt tct cat tgt att gaa 1182 
Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu 
250 255 260 

eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat aag gaa 1230 
Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp Lys Glu 
265 270 275 280 

act gag aag aaa aga aga aca gag gaa gca tat aaa aat gca atg aca 1278 
Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala Met Thr 
285 290 295 

gaa ctt aag aaa aaa tec cac ttt gga gga cca gat tat gaa gaa ggc 1326 
Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu Glu Gly 
300 305 " 310 

cct aac agt ctg att aat gaa gaa gag ttc ttt gat get gtt gaa get 13 74 
Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala 
315 320 325 

get ctt gac aga caa gat aaa ata gaa gaa cag tea cag agt gaa aag 1422 
Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu Lys 
330 335 340 

gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat gee ttt tct 1470 
Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser 
345 350 355 360 

tct gtg ggg aca cat aga ttt gtc caa aag gtt gaa gag atg gtg cag 1518 
Ser Val Gly Thr His Arg Phe Val Gin Lys Val Glu Glu Met Val Gin 
365 370 375 

aac cac atg act tac tea tta cag gat gta ggc gga gat gec aat tgg 1566 
Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp 
380 385 390 

cag ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta 1614 
Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val 
395 400 405 

gaa gaa aat ggg att gtt ctg gat cct tta aaa get acc cat gca gtt 1662 
Glu Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val 
410 415 420 

aaa ggc gtc aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac 1710 
Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp 
425 430 435 440 

gtt cgc aat gac tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa 1758 
Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu 
445 450 455 

aca tta get gat aat gca ate ate att tat caa aca cac aag agg gtg 1806 
Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys Arg Val 
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460 465 470 

tgg cct get tct cag cga gac gta tta tat ctt tct gtc att cga aag 1854 

Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys 
475 480 485 

ata cca gec ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat 1902 

He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn 
490 495 500 

ttt tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc cgt 1950 

Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg 

505 510 515 4 520 

gec aaa ata aat gtt get atg att tgt caa acc ttg gta age cca cca 1998 

Ala Lys He Asn Val Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro 

525 530 535 

gag gga aac cag gaa att age agg gac aac att eta tgc aag att aca 2046 

Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr 

540 545 S50 

tat gta get aat gtg aac cct gga gga tgg gca cca gec tea gtg tta 2094 

Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu 
555 560 565 

agg gca gtg gca aag cga gag tat cct aaa ttt eta aaa cgt ttt act 2142 

Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr 
570 575 580 

tct tac gtc caa gaa aaa act gca gga aag cct att ttg ttc tag 218 7 

Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
585 590 595 



<210> 8 
<211> 598 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 3 0 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 " 45 

Asn Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 4 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 HO 
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Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 * 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 ~ 175 

Thr Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 ' 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Leu Phe Pro His Val Thr Pro Lys Gly lie Asn Gly He Asp 
225 230 235 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu 
275 280 285 

Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 " " 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 L 350 

Leu Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lys Val Glu Glu Met val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 
405 410 415 



Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 
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Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie He 
450 455 460 

He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

Leu Tyr Leu Ser Val He Arg Lys He Pro Ala Leu Thr Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie Ser Arg 
530 535 540 

Asp Asn He Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

Gly Lys Pro He Leu Phe 
595 



<210> 9 

<211> 2684 

<212> DNA 

<213> Mus musculus 

<220> 
<221> CDS 

<222> (444) . . (2237) 
<400> 9 

cgggccacca cgtgtaaata gtatcggacc cggcaggaag atggcggctg tagcggaggt 60 

gtgagtgagt ggatctgggt ctctgccgtt ggcttggctc tccccgtctt cctcccctcc 12 0 

tccctccctg actgaggttg gcatctaggg ggccgagttc aggtggcggc gccgggcgca 180 

gcgcaggggt cacggccacg gcggctgacg gctggaaggg caggctttct tcgccgctcg 24 0 

tcctccttcc ccggtccgct cggtgtcagg cgcggcggcg gcggcgcggc gggcgcgctt 3 00 

cgtccctctt cctgttccct cactccccgg agcgggctct cttggcggtg ccatcccccg 360 

acccttcacc ccagggacta ggcgcctgca ctggcgcagc tcgcggagcg ggggccggtc 42 0 

tcctgctcgg ctgtcgcgtc tec atg teg gat aac cag age tgg aac teg teg 473 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser 

15 10 
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ggc teg gag gag gat ccg gag acg gag tec ggg ccg cct gtg gag cgc 521 
Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val Glu Arg 
15 20 25 

tgc ggg gtc etc age aag tgg aca aac tat att cat gga tgg cag gat 569 
Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr lie His Gly Trp Gin Asp 
30 35 40 

cgt tgg gta gtt ttg aaa aat aat act ttg agt tac tac aaa tct gaa 617 
Arg Trp Val Val Leu Lys Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu 
45 50 55 

gat gaa aca gaa tat ggc tgt agg gga tec ate tgt ctt age aag get 665 
Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser He Cys Leu Ser Lys Ala 
60 65 70 

gtg ate acg cct cac gat ttt gat gaa tgc egg ttt gat ate agt gta 713 
Val He Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp lie Ser Val 
75 80 85 90 

aat gat agt gtt tgg tac ctt cga get cag gac ccg gag cac aga cag 761 
Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Glu His Arg Gin 
95 100 105 

caa tgg gta gac gee att gaa cag cac aag act gaa teg gga tat gga 8 09 
Gin Trp Val Asp Ala He Glu Gin His Lys Thr Glu Ser Gly Tyr Gly 
110 115 120 

tct gag tec age ttg cgt aga cat ggc tea atg gtg tea ctg gtg tct 857 
Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu Val Ser 
125 130 135 

gga gcg agt ggc tat tct get acg tec acc tct tct ttc aag aaa ggc 905 
Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly 
140 145 150 

cac agt tta cgt gag aaa ctg get gaa atg gag aca ttt egg gac ate 953 
His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg Asp He 
155 160 165 170 

ctg tgc egg cag gtt gat act etc cag aag tac ttt gat gtc tgt get 1001 
Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala 
175 180 185 

gac get gtc tec aag gat gag ctt cag agg gat aaa gtc gta gaa gat 1049 
Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val Glu Asp 
190 195 200 

gat gaa gat gac ttc cct aca act cgt tct gat gga gac ttt ttg cac 1097 
Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe Leu His 
205 210 215 

aat acc aat ggt aat aaa gaa aaa tta ttt cca cat gta aca cca aaa 1145 
Asn Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr Pro Lys 
220 225 230 

gga att aat ggc ata gac ttt aaa ggg gaa gca ata act ttt aaa gca 1193 
Gly He Asn Gly He Asp Phe Lys Gly Glu Ala He Thr Phe Lys Ala 
235 240 245 250 
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act act get gga ate ctt get aca ctt tct cat tgt att gaa tta atg 1241 
Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu Leu Met 
255 260 265 

gta aaa egg gaa gag age tgg caa aaa aga cac gat agg gaa gtg gaa 1289 
val Lys Arg Glu Giu Ser Trp Gin Lys Arg His Asp Arg Glu Val Glu 
270 275 280 

aag agg aga cga gtg gag gaa gcg tac aag aat gtg atg gaa gaa ctt 1337 
Lys Arg Arg Arg Val Glu Glu Ala Tyr Lys Asn Val Net Glu Glu Leu 
285 290 295 

aag aag aaa ccc cgt ttc gga ggg ccg gat tat gaa gaa ggt cca aac 1385 
Lys Lys Lys Pro Arg Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
300 305 310 

agt ctg att aat gag gaa gag ttc ttt gat get gtt gaa get get ctt 1433 
Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
315 320 325 330 

gac aga caa gat aaa ata gag gaa cag tea cag agt gaa aag gtc agg 14 81 
Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
335 340 345 

tta cac tgg ccc aca tea ttg cca tct gga gac ace ttt tct tct gtc 1529 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Thr Phe Ser Ser Val 
350 355 ' 360 

ggg acg cat aga ttt gta caa aag gtt gaa gaa atg gta cag aac cac 1577 
Gly Thr His Arg Phe Val Gin Lys Val Glu Glu Met Val Gin Asn His 
365 370 375 

atg aac tat tea tta cag gat gta ggt ggt gat gca aat tgg caa ctg 162 5 
Met Asn Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu 
380 385 390 

gtt gtt gaa gaa gga gaa atg aag gta tac aga aga gaa gtg gaa gaa 1673 
Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu 
395 400 405 410 

aat gga att gtt ctg gat cct ttg aaa get act cat gca gtt aaa ggt 1721 
Asn Gly lie Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly 
415 420 425 

gtt aca gga cat gag gtc tgc aat tac ttt tgg aat gtt gat gtt cgc 1769 
Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asd Val Arg 
430 435 440 

aat gac tgg gaa act act ata gaa aac ttt cat gtg gtg gaa aca tta 1817 
Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu 
445 450 455 

get gat aat gca ate ate gtt tat caa acg cac aag aga gta tgg ccc 1865 
Ala Asp Asn Ala He He Val Tyr Gin Thr His Lys Arg Val Trp Pro 
460 465 470 

get tct cag aga gac gta ctg tat ctt tct get att cga aag ate cca 1913 
Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He Arg Lys He Pro 
475 480 485 490 

gee ttg act gaa aat gat cct gaa act tgg ata gtt tgt aat ttt tct 1961 
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Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe Ser 
495 500 * 505 

gtg gat cat gat agt get cct ctg aac aat cga tgt gtc cgt gec aaa 2 00 9 
Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys 
510 515 520 

ate aat att get atg att tgt caa act tta gta age cca cca gag gga 2057 
He Asn He Ala Met He Cys Gin Thr Leu Val Ser Pro Pro Glu Gly 
525 530 535 

gac cag gag ata age aga gac aac att ctg tgc aag ate acg tat gta 2105 
Asp Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr Val 
540 545 550 

get aat gtg aac cca gga gga tgg gcg cca get teg gtc tta aga gca 215 3 
Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala 
555 560 565 570 

gtg gca aag cga gaa tac cct aag ttt eta aaa cgt ttt act tct tat 2201 
Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr 
575 580 585 

gtc caa gaa aaa act gca gga aaa cca att ttg ttt tagtattaac 2247 
Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 





590 


595 








agtgactgaa 


geaaggctge 


gtgacgttcc atgttggaga 


aaggagggaa 


aaaataaaaa 


2307 


gaatcctcta 


agctggaacg 


taggatctac agecttgtet 


gtggcccaag 


aagaaacatt 


2367 


geaategtaa 


agctgggtat 


ccagcactag ccatctcctg 


ctaggcctcc 


tcgctcagcg 


2427 


tgtaactata 


aatacatgta 


gaatcacatg gatatggcta 


tatttttatt 


tgcttgctcc 


2487 


ttggagtgaa 


aacaaataac 


tttgaattac aactaggaat 


taaccgatgc 


tttaattttg 


2547 


aggaactttt 


tcagaatttt 


ttatttacca tggtccaacc 


taagatcctc 


agttgtatca 


2607 


agtttttgtg 


cacaaaagaa 


aagcacaaaa gttgaacgea 


cctgaaggca 


tgtgctctct 


2667 


gtgcaacaaa 


tactcag 








2684 



<210> 10 

<211> 598 

<212> PRT 

<213> Mus musculus 

<400> 10 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 ~ 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 
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Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Glu His Arg Gin Gin Trp Val Asp Ala He 
100 105 no 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
US 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 ~ 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Glu Ser 
260 265 270 

Trp Gin Lys Arg His Asp Arg Glu Val Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Val Met Glu Glu Leu Lys Lys Lys Pro Arg Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 350 

Leu Pro Ser Gly Asp Thr Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 



Gin Lys Val Glu Glu Met Val Gin Asn His Met Asn Tyr Ser Leu Gin 
370 375 380 
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Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 
405 410 ^ 415 

Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He 
450 455 460 

val Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

Leu Tyr Leu Ser Ala He Arg Lys He Pro Ala Leu Thr Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn lie Ala Met He 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asp Gin Glu He Ser Arg 
530 535 ' 540 

Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 * 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 



Gly Lys Pro He Leu Phe 
595 



<210> 11 
<211> 2283 
<212> DNA 
<213> Bos taurus 

<220> 
<221> CDS 

<222> (421) . . (2214) 
<400> 11 

cggcaggaag atggcggcct agcggaggtg tgagtggacc tgggtctctg cagctgggtt 60 
ttccctcttc ccgtctttct cctcttttcc tctcccccga ggttggcatc gagggggcca 12 0 
aattcgggcg gcggcgccgg gcgcagcgca ggggtcacaa cgacggcgac ggctgacggt 180 
tggaagggca ggcttccttc gcccctcgac ctccttcccc ggtccgcttg gtgtcaggcg 24 0 
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cggcggcggc ggcggcggcg gcgcggcggg cggactccat ccctcctccc gctccctcct 300 

gcaccggagc gggcactcct tccttcgcca tcccccgacc cttcaccccg gggactgggc 3 60 

gcctccaccg gcgcagctca gggagcgggg gccggtctcc tgctcggctg tcgcgcctcc 420 

atg teg gat aac cag age tgg aac teg teg ggc teg gag gag gat ccg 4 68 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 

1 5 10 15 

gag acg gag tec ggg ccg ccg gtg gag cgc tgc gga gtc etc aac aag 516 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 

20 25 30 

tgg aca aac tat att cat ggg tgg cag gat cgc tgg gta gtt ttg aaa 564 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 

35 40 " 45 

aat aac act ctg agt tac tac aaa tct gaa gat gag aca gag tat ggc 612 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 

50 55 60 

tgc aga gga tec ate tgt ctt age aag get gtc ate acg cct cat gat 660 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 

65 70 75 80 

ttt gat gaa tgc cga ttt gat att agt gta aat gat agt gtt tgg tat 708 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 

85 90 95 

ctt cgt get caa gat cca gat cac aga cag cag tgg ata gat gec att 756 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 

100 105 no 

gaa cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt 804 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 

115 120 ~ 125 

cga cat ggc tec atg gta tea ttg gta tec gga gca agt ggc tat tct 852 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 

130 135 140 

gca aca tec ace tec tea ttc aag aag ggc cac agt tta cgt gag aaa 900 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 

145 150 155 160 

ctg get gaa atg gaa ace ttt aga gat ata ctg tgt aga caa gtt gat 94 8 

Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 

165 170 175 

ace eta cag aag ttc ttt gat gec tgt get gat get gtc tec aag gat 996 

Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 

180 IBS 190 

gaa ttt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct 1044 

Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 

195 200 205 

acg aca cgt tct gat gga gac ttc ttg cat aat ace aat ggc aat aag 10 92 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 

210 215 220 
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gaa aag gta ttt cca cat gta aca cca aaa gga att aat ggt ata gac 1140 
Glu Lys Val Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 ' 240 

ttt aaa ggt gag gcg ata act ttt aaa gca act act gcc gga ate ctt 1188 
Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

get aca ctt tct cat tgt att gag ctg atg gta aaa cgt gag gac age 1236 
Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

tgg caa aag aga atg gac aag gaa act gag aag aga aga aga gtg gag 12 B4 
Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 

gaa gca tac aaa aat gcc atg aca gaa ctt aag aaa aaa tec cac ttt 13 32 
Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

gga gga cca gat tat gag gaa ggc cca aac agt ttg att aat gaa gag 1380 
Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

gag ttc ttt gat get gtt gaa get get ctt gac aga caa gat aaa ata 1428 
Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

gaa gaa cag teg cag agt gaa aag gtc agg tta cat tgg tct act tea 1476 
Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

atg cca tct gga gat gcc ttt tct tct gtg ggg act cat aga ttt gtc 1524 
Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

caa aag gtt gaa gag atg gtg cag aac cac atg acc tat tea ttg cag 1572 
Gin Lys Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

gat gta ggt ggg gac gcc aac tgg cag ttg gtt gta gaa gaa ggg gag 1620 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

atg aag gta tat aga aga gaa gta gaa gaa aat ggg att gtt ctg gat 1668 
Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 
405 410 415 

cct ttg aaa get acc cat gca gtt aaa ggc gtt aca gga cac gag gtc 1716 
Pro Leu Lys Ala Thr His Ala val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

tgc aat tac ttc tgg aat gtt gat gtt cgc aat gat tgg gaa aca act 1764 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 1812 
He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He 
450 455 460 
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att tat caa acg cac aag aga gtg tgg cca gcc tct cag egg gat gtc 1860 
He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

tta tat ctg tct gcc att cga aag ata cca get ttg aat gaa aat gac 1908 
Leu Tyr Leu Ser Ala He Arg Lys He Pro Ala Leu Asn Glu Asn Asp 
485 490 495 

ccg gag act tgg ata gtt tgt aat ttt tct gta gat cac age agt get 1956 
Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Ser Ser Ala 
500 505 510 

cct eta aac aat cga tgt gtc cgt gcc aaa ata aac gtt get atg att 2 004 
Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

tgt cag ace ttg gtg age ccc cca gag gga aac cag gag att age agg 2 052 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie Ser Arg 
530 535 540 

gac aac att eta tgc aag att aca tac gtg gcc aat gta aac cct gga 2100 
Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

gga tgg gcc cca gcc tea gtg tta egg gca gtg gca aag cga gaa tat 214 8 
Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

cca aag ttt eta aag cgt ttt act tct tac gta caa gaa aaa act gca 2196 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

gga aaa cct att ttg ttc tagtattaac agtgactgaa gcaaggctgt 2244 
Gly Lys Pro He Leu Phe 
595 

gtgacattcc atgttggagg aaaaaaaaaa aaaaaaaaa 2283 



<210> 12 
<211> 598 
<212> PRT 
<213> Bos taurus 

<400> 12 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
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85 



90 



95 



Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 no 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 ~ 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 " 175 

Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Olu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Val Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lvs Lys Ser His Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 1 " 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lys Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 



Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 

385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 

405 410 415 
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Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He 
450 455 460 

He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 ** 480 

Leu Tyr Leu Ser Ala He Arg Lys He Pro Ala Leu Asn Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Ser Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu He Ser Arg 
530 535 540 

Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

Gly Lys Pro He Leu Phe 
595 



<210> 13 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> (1) . . (78) 

<400> 13 

ccc tat age cgc tct tec tec atg tct tec att gat eta gtc agt gec 4 8 
Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala 
15 10 is 

tct gat gat gtt cac aga ttc age tec cag 7a 
Ser Asp Asp Val His Arg Phe Ser Ser Gin 
20 25 



<210> 14 
<211> 26 
<212> PRT 

<213> Homo sapiens 
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<400> 14 

Pro Tyr Ser Arg Ser Ser Ser Met 
1 5 

Ser Asp Asp Val His Arg Phe Ser 

20 



Ser Ser He Asp Leu Val Ser Ala 
10 15 

Ser Gin 
25 



<210> 15 
<211> 2034 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBPR3 

<220> 

<221> CDS 

<222> (10) . . (990) 

<400> 15 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 

1 5 io 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 339 
Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 HO 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gcc att gaa 387 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala He Glu 
115 120 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 4 83 
His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 
145 150 155 
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aca tec acc tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 
Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 
160 165 170 

get gaa atg gaa aca ttt aga gac ate tta tgt aga caa gtt gac acg 57 9 
Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr 
17S 180 185 190 

eta cag aag tac ttt gat gec tgt get gat get gtc tct aag gat gaa €27 
Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala val Ser Lys Asp Glu 
195 200 205 

ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 67 5 
Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 
210 215 220 

acg cgt tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa 723 
Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 
225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 
Lys Leu Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe 
240 245 250 

aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 819 
Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala 
255 * 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 867 
Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp 
275 280 " 285 

cag aag aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa 915 
Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu 
290 ' 295 300 

gca tat aaa aat gca atg aca gaa cga aaa aat ccc act ttg gag gac 963 
Ala Tyr Lys Asn Ala Met Thr Glu Arg Lys Asn Pro Thr Leu Glu Asp 
305 310 315 

cag att atg aag aag gec eta aca gtc tgattaatga agaagagttc 10 10 

Gin He Met Lys Lys Ala Leu Thr Val 



320 




325 










tttgatgctg 


ttgaagctgc 


tcttgacaga 


caagataaaa 


tagaagaaca 


gtcacagagt 


1070 


gaaaaggtga 


gattacattg 


gcctacatcc 


ttgccctctg 


gagatgeett 


ttcttctgtg 


1130 


gggacacata 


gatttgtcca 


aaagecctat 


agtegctett 


cctccatgtc 


ttccattgat 


1190 


ctagtcagtg 


cctctgatga 


tgttcacaga 


ttcagctccc 


aggttgaaga 


gatggtgcag 


1250 


aaccacatga 


cttactcatt 


acaggatgta 


ggcggagatg 


ccaattggca 


gttggttgta 


1310 


gaagaaggag 


aaatgaaggt 


atacagaaga 


gaagtagaag 


aaaatgggat 


tgttctggat 


1370 


cctttaaaag 


ctacccatgc 


agttaaaggc 


gtcacaggac 


atgaagtctg 


caattatttc 


1430 


tggaatgttg 


acgttcgcaa 


tgactgggaa 


acaactatag 


aaaactttca 


tgtggtggaa 


1490 
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acattagctg ataatgcaat catcatttat caaacacaca agagggtgtg gcctgcttct 1550 

cagcgagacg tattatatct ttctgtcatt cgaaagatac cagccttgac tgaaaatgac 1610 

cctgaaactt ggatagtttg taatttttct gtggatcatg acagtgctcc tctaaacaac 1670 

cgatgtgtcc gtgccaaaat aaatgttgct atgatttgtc aaaccttggt aagcccacca 1730 

gagggaaacc aggaaattag cagggacaac attctatgca agattacata tgtagctaat 17 90 

gtgaaccctg gaggatgggc accagcctca gtgttaaggg cagtggcaaa gcgagagtat 1850 
cctaaatttc taaaacgttt tacttcttac gtccaagaaa aaactgcagg aaagcctatt 1910 

ttgttctagt attaacaggt actagaagat atgttttatc tttttttaac tttatttgac 1970 

taatatgact gtcaatacta aaatttagtt gttgaaagta tttactatgt tttttccgga 2 030 
atcc 2034 



<210> 16 
<211> 327 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: GPBPR3 
<400> 16 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
145 150 155 160 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
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165 

Met Glu Thr Phe Arg Asp lie 
180 

Lys Tyr Phe Asp Ala Cys Ala 
195 

Arg Asp Lys Val Val Glu Asp 
210 215 

Ser Asp Gly Asp Phe Leu His 
225 230 

Phe Pro His Val Thr Pro Lys 
245 

Glu Ala He Thr Phe Lys Ala 
260 

Ser His Cys lie Glu Leu Met 
275 

Arg Leu Asp Lys Glu Thr Glu 
290 295 

Lys Asn Ala Met Thr Glu Arg 
305 310 

Met Lys Lys Ala Leu Thr Val 
325 



<210> 17 
<211> 1978 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG - GPB PDNLS 

<220> 

<221> CDS 

<222> (10) . . (1860) 

<400> 17 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
1 5 io 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 * 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 4 5 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 



170 

Leu Cys Arg 
185 

Asp Ala Val 
200 

Asp Glu Asp 

Ser Thr Asn 

Gly lie Asn 
250 

Thr Thr Ala 
265 

Val Lys Arg 
280 

Lys Lys Arg 
Lys Asn Pro 



Gin Val Asp 



Ser Lys Asp 
205 

Asp Phe Pro 

220 

Gly Asn Lys 
235 

Gly lie Asp 



Gly lie Leu 



Glu Asp Ser 
285 

Arg Thr Glu 
300 

Thr Leu Glu 
315 



175 

Thr Leu Gin 
190 

Glu Leu Gin 

Thr Thr Arg 

Glu Lys Leu 
240 

Phe Lys Gly 
255 

Ala Thr Leu 
270 

Trp Gin Lys 

Glu Ala Tyr 

Asp Gin lie 
320 
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aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 

Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cvs 

65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 

Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 

80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 339 

Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 

95 100 105 no 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gee att gaa 387 

Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 

115 120 " 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 

Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 

130 135 140 

cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 4 83 

His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 

145 150 155 

aca tec ace tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 

Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 

160 165 170 

get gaa atg gaa aca ttt aga gac ate tta tgt aga caa gtt gac acg 579 

Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr 

175 180 185 190 

eta cag aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa 627 

Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 

195 200 * 205 

ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 675 

Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 

210 215 " 220 

acg cgt tct gat ggt gac ttc ttg cat agt ace aac ggc aat aaa gaa 723 

Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 

225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 

Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp Phe 

240 245 250 

aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 819 

Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala 

255 260 265 * 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 867 

Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp 

275 280 ~ 285 

cag aag aga ctg gat aag gaa act gag cac ttt gga gga cca gat tat 915 

Gin Lys Arg Leu Asp Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr 

290 295 300 
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gaa gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt gat get 963 

Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala 

305 310 315 

gtt gaa get get ctt gac aga caa gat aaa ata gaa gaa cag tea cag 1011 
Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin 
320 325 330 

agt gaa aag gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat 1059 

Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp 
335 340 345 350 

gee ttt tct tct gtg ggg aca cat aga ttt gtc caa aag ccc tat agt 1107 

Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser 
355 360 365 

cgc tct tec tec atg tct tec att gat eta gtc agt gee tct gat gat 1155 

Arg Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp 
370 375 380 

gtt cac aga ttc age tec cag gtt gaa gag atg gtg cag aac cac atg 1203 

Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met 

385 390 395 

act tac tea tta cag gat gta ggc gga gat gee aat tgg cag ttg gtt 1251 

Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val 
400 405 410 

gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta gaa gaa aat 12 99 

Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn 
415 420 425 430 

ggg att gtt ctg gat cct tta aaa get acc cat gca gtt aaa ggc gtc 1347 

Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val 
435 440 445 

aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac gtt cgc aat 1395 

Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn 
450 455 460 

gac tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa aca tta get 1443 

Asp Trp Glu Thr Thr lie Glu Asn Phe His Val Val Glu Thr Leu Ala 

465 470 475 

gat aat gca ate ate att tat caa aca cac aag agg gtg tgg cct get 1491 

Asp Asn Ala He He He Tyr Gin Thr His Lys Arg Val Trp Pro Ala 
480 485 490 

tct cag cga gac gta tta tat ctt tct gtc att cga aag ata cca gec 153 9 

Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys He Pro Ala 
495 500 505 " 510 

ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt tct gtg 1587 

Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe Ser Val 
515 520 525 

gat cat gac agt get cct eta aac aac cga tgt gtc cgt gee aaa ata 1635 

Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He 
530 535 540 

aat gtt get atg att tgt caa acc ttg gta age cca cca gag gga aac 16B3 
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Asn Val Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn 
545 550 555 

cag gaa att age agg gac aac att eta tgc aag att aca tat gta get 17 31 
Gin Glu lie Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr Val Ala 
560 565 570 

aat gtg aac cct gga gga tgg gca cca gee tea gtg tta agg gca gtg 1779 
Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val 
575 580 585 ~ 590 

gca aag cga gag tat cct aaa ttt eta aaa cgt ttt act tct tac gtc 1827 
Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val 
595 600 605 

caa gaa aaa act gca gga aag cct att ttg ttc tagtattaac aggtactaga 1880 
Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 

agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt 1940 

agttgttgaa agtatttact atgttttttc eggaatte 1978 



<210> 18 
<211> 617 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG - GPB PDNLS 
<400> 18 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 * 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 .55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 * 80 

Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 * 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
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145 150 155 



160 



Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
1^5 170 175 

Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr Leu Gin 
180 185 190 

Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
195 200 205 

Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
210 215 220 

Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
225 230 235 2 40 

Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp Phe Lys Gly 
245 250 " 255 

Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu 
260 265 270 

Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
275 280 285 

Arg Leu Asp Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr Glu Glu 
290 295 300 

Gly Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu 
305 310 315 * 320 

Ala Ala Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu 
325 330 335 

Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe 
340 345 350 

Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser 
355 360 365 

Ser Ser Met Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp Val His 
370 375 380 

Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr Tyr 
385 390 395 400 

Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu 
405 410 415 

Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He 
420 425 430 

Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly 
435 440 445 

His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp 
450 455 460 

Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn 
465 470 475 480 
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Ala lie lie lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin 
435 490 495 

Arg Asp Val Leu Tyr Leu Ser Val lie Arg Lys He Pro Ala Leu Thr 
500 505 510 

Glu Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His 
515 520 525 

Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val 
530 535 540 

Ala Met He Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu 
545 550 555 560 

He Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val 
565 570 575 

Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys 
580 585 590 

Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu 
595 600 605 



Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 



<210> 19 
<211> 1975 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG-GPBPDSXY 

<220> 

<221> CDS 

<222> (10) . . (1857) 

<400> 19 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 

1 5 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 
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aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 2 91 
Arg Gly Ser He Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 339 
Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 no 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gee att gaa 3 87 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 
US 120 ' 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca 4 83 
His Gly Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr 
145 150 155 

ttt aga gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt 531 
Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lvs Tyr Phe 
160 165 170 

gat gee tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa 579 
Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys 
175 180 185 190 

gtg gta gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt 627 
Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly 
195 200 ~ 205 

gac ttc ttg cat agt ace aac ggc aat aaa gaa aag tta ttt cca cat 675 
Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His 
210 215 220 

gtg aca cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata 723 
Val Thr Pro Lys Gly He Asn Gly He Asp Phe Lys Gly Glu Ala He 
225 230 235 

act ttt aaa gca act act get gga ate ctt gca aca ctt tct cat tgt 771 
Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys 
240 245 250 

att gaa eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat 819 
He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp 
255 260 265 270 

aag gaa act gag aag aaa aga aga aca gag gaa gca tat aaa aat gca 867 
Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala 
275 280 285 

atg aca gaa ctt aag aaa aaa tec cac ttt gga gga cca gat tat gaa 915 
Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu 
290 295 300 

gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt gat get gtt 963 
Glu Gly Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val 
305 310 315 



41 



WO 00/50607 



PCT/IB00/00324 



gaa get get ctt gac aga caa gat aaa ata gaa gaa cag tea cag agt 1011 
Glu Ala Ala Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser 
320 325 330 

gaa aag gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat gee 1059 
Glu Lys val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala 
335 340 345 350 

ttt tct tct gtg ggg aca cat aga ttt gtc caa aag ccc tat agt cgc 1107 
Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg 
355 360 365 

tct tec tec atg tct tec att gat eta gtc agt gec tct gat gat gtt 1155 
Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp Val 
370 375 380 

cac aga ttc age tec cag gtt gaa gag atg gtg cag aac cac atg act 1203 
His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr 
385 390 395 

tac tea tta cag gat gta ggc gga gat gec aat tgg cag ttg gtt gta 1251 
Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val 
400 405 410 

gaa gaa gga gaa atg aag gta tac aga aga gaa gta gaa gaa aat ggg 1299 
Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly 
415 420 425 430 

att gtt ctg gat cct tta aaa get ace cat gca gtt aaa ggc gtc aca 1347 
He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr 
435 440 445 

gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac gtt cgc aat gac 13 95 
Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp 
450 455 460 

tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa aca tta get gat 1443 
Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp 
465 470 475 

aat gca ate ate att tat caa aca cac aag agg gtg tgg cct get tct 1491 
Asn Ala He He He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser 
480 485 490 

cag cga gac gta tta tat ctt tct gtc att cga aag ata cca gec ttg 1539 
Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys He Pro Ala Leu 
495 500 505 510 

act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt tct gtg gat 1587 
Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp 
515 520 525 

cat gac agt get cct eta aac aac cga tgt gtc cgt gec aaa ata aat 1635 
His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He A3n 
530 535 540 

gtt get atg att tgt caa acc ttg gta age cca cca gag gga aac cag 1683 
Val Ala Met He Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin 
545 550 555 

gaa att age agg gac aac att eta tgc aag att aca tat gta get aat 1731 
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Glu lie Ser Arg Asp Asn lie Leu Cys Lys lie Thr Tyr val Ala Asn 
560 565 570 

gtg aac cct gga gga tgg gca cca gcc tea gtg tta agg gca gtg gca 1779 
Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala 
575 580 585 590 

aag cga gag tat cct aaa ttt eta aaa cgt ttt act tct tac gtc caa 1327 
Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin 
595 600 605 

gaa aaa act gca gga aag cct act ttg ttc tagtattaac aggtactaga 1877 
Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
610 615 

agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt 1937 

agttgttgaa agtatttact atgttttttc eggaatte 1975 



<210> 20 

<211> 616 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: FLAG -GPBPDSXY 
<400> 20 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
1 5 10 * 15 



Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 



Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 



Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser He Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
145 150 155 160 

Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
165 170 175 
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Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
180 185 X90 

Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
195 200 205 

Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr 
210 215 2 20 ^ 

Pro Lys Gly lie Asn Gly He Asp Phe Lys Gly Glu Ala He Thr Phe 
225 230 235 240 

Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu 
245 250 255 

Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp Lys Glu 
260 265 ~ 270 

Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala Met Thr 
275 280 285 

Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu Glu Gly 
290 295 3 00 Y 



Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Al 
305 310 315 



a 

320 

Ala Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys 
325 330 335 

Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser 
34 <> 345 350 

Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser 



355 



360 



365 



Ser Met Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp Val His Arq 
370 375 3 8 o 

Phe ser Ser Gin val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser 
385 395 400 

Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu 
405 4io 415 

Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val 
420 425 430 



Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His 
435 440 * 445 

Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu 

450 455 460 

Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala 



465 



470 



475 



480 



He He He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg 
485 4 9 o 495 
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Asp Val Leu Tyr Leu Ser Val lie Arg Lys He Pro Ala Leu Thr Glu 
500 505 510 

Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp 
515 520 525 

Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala 
530 535 540 

Met He Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie 
545 550 555 560 

Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn 
565 570 575 

Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg 
580 585 590 

Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys 
595 600 605 



Thr Ala Gly Lys Pro He Leu Phe 
610 615 



<210> 21 
<211> 1915 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
FLAG - GPB PDSXY /NLS 

<220> 

<221> CDS 

<222> (10) . . (1797) 

<400> 21 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 
Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 * 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 " 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 
Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 
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80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 339 
Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 " 110 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gec att gaa 3 87 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu 
115 120 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca 4 83 
His Gly Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr 
145 150 155 

ttt aga gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt 531 
Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe 
160 165 170 

gat gee tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa 579 
Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys 
175 180 185 190 

gtg gta gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt 627 
Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly 
195 200 205 

gac ttc ttg cat agt acc aac ggc aat aaa gaa aag tta ttt cca cat 675 
Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His 
210 215 220 

gtg aca cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata 72 3 
Val Thr Pro Lys Gly He Asn Gly He Asp Phe Lys Gly Glu Ala He 
225 230 235 

act ttt aaa gca act act get gga ate ctt gca aca ctt tct cat tgt 771 
Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys 
240 245 250 

att gaa eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat 819 
He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp 
255 260 265 " ~ 270 

aag gaa act gag cac ttt gga gga cca gat tat gaa gaa ggc cct aac 867 
Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
275 280 285 

agt ctg att aat gaa gaa gag ttc ttt gat get gtt gaa get get ctt 915 
Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
290 295 300 

gac aga caa gat aaa ata gaa gaa cag tea cag agt gaa aag gtg aga 963 
Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
305 310 315 

tta cat tgg cct aca tec ttg ccc tct gga gat gee ttt tct tct gtg 1011 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser Ser Val 
320 325 330 
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ggg aca cat aga ttt gtc caa aag ccc tat agt cgc tct tec tec atg 1059 
Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met 
335 340 345 350 

tct tec att gat eta gtc agt gec tct gat gat gtt cac aga ttc age 1107 
Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val His Arg Phe Ser 
355 360 365 

tec cag gtt gaa gag atg gtg cag aac cac atg act tac tea tta cag 1155 
Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

gat gta ggc gga gat gec aat tgg cag ttg gtt gta gaa gaa gga gaa 1203 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 

atg aag gta tac aga aga gaa gta gaa gaa aat ggg att gtt ctg gat 1251 
Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
400 405 410 

cct tta aaa get acc cat gca gtt aaa ggc gtc aca gga cat gaa gtc 1299 
Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
415 420 425 430 

tgc aat tat ttc tgg aat gtt gac gtt cgc aat gac tgg gaa aca act 1347 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 1395 
lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He 
450 455 460 

att tat caa aca cac aag agg gtg tgg cct get tct cag cga gac gta 1443 
He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 

tta tat ctt tct gtc att cga aag ata cca gec ttg act gaa aat gac 1491 
Leu Tyr Leu Ser Val He Arg Lys He Pro Ala Leu Thr Glu Asn Asp 
480 485 490 

cct gaa act tgg ata gtt tgt aat ttt tct gtg gat cat gac agt get 1539 
Pro Glu Thr Trp He Val Cys Asn Phe ser Val Asp His Asp Ser Ala 
495 500 505 510 

cct eta aac aac cga tgt gtc cgt gec aaa ata aat gtt get atg att 1587 
Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

tgt caa acc ttg gta age cca cca gag gga aac cag gaa att age agg 1635 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu He Ser Arg 
530 535 540 

gac aac att eta tgc aag att aca tat gta get aat gtg aac cct gga 1683 
Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 

gga tgg gca cca gee tea gtg tta agg gca gtg gca aag cga gag tat 1731 
Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
560 565 570 
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cct aaa ttt eta aaa cgt ttt act tec tac gtc caa gaa aaa act gca 1779 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr val Gin Glu Lys Thr Ala 
575 580 585 590 

gga aag cct att ttg ttc tagtattaac aggtactaga agatatgttt 1827 
Gly Lys Pro lie Leu Phe 
595 

tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt agttgttgaa 1887 
agtatttact atgttttttc eggaatte 1915 



<210> 22 
<211> 596 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
FLAG -GPBPDSXY / NLS 

<400> 22 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
1 5 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
145 150 155 160 

Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
165 170 175 

Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
180 185 190 

Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
195 200 205 
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Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr 
210 215 220 

Pro Lys Gly He Asn Gly lie Asp Phe Lys Gly Glu Ala He Thr Phe 
225 230 235 240 

Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys He Glu 
245 250 255 

Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp Lys Glu 
260 265 270 

Thr Glu His Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu 
275 280 285 

He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arq 
290 295 300 

Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His 
305 310 315 320 

Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr 
325 330 335 

His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser 
340 345 350 

He Asp Leu Val Ser Ala Ser Asp Asp Val His Arg Phe Ser Ser Gin 
355 360 365 

Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val 
370 375 380 

Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys 
385 390 395 ' 400 

Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp Pro Leu 
405 410 415 

Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn 
420 425 430 

Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu 
435 440 445 

Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He He Tyr 
450 455 460 

Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr 
465 470 475 480 

Leu Ser Val He Arg Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu 
485 490 495 

Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu 
500 505 510 



Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He Cys Gin 
515 520 525 
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Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie Ser Arg Asp Asn 
530 535 540 

He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp 
545 550 555 560 

Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys 
565 570 " 575 

Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys 
580 585 * 590 



Pro He Leu Phe 
595 



<210> 23 
<211> 2038 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBP-D169A 

<220> 

<221> CDS 

<222> (10) . . (1920) 

<400> 23 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 14 7 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 2 91 
Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 339 
Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 ' 110 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gcc att gaa 3 87 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 
115 120 125 
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cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 43 5 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 



cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 
His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 
145 150 155 



get gaa atg gaa aca ttt aga gec ate tta tgt aga caa gtt gac acg 
Ala Glu Met Glu Thr Phe Arg Ala lie Leu Cys Arg Gin Val Asp Thr 
175 180 185 190 



483 



aca tec acc tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 
Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 
160 165 170 



gca tat aaa aat gca atg aca gaa ctt aag aaa aaa tec cac ttt gga 
Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly 
305 310 315 

gga cca gat tat gaa gaa ggc cct aac agt ctg att aat gaa gaa gag 
Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu Glu 
320 325 330 

ttc ttt gat get gtt gaa get get ctt gac aga caa gat aaa ata gaa 
Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He Glu 
335 340 345 350 

gaa cag tea cag agt gaa aag gtg aga tta cat tgg cct aca tec ttg 
Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu 
355 360 365 

ccc tct gga gat gee ttt tct tct gtg ggg aca cat aga ttt gtc caa 



579 



eta cag aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa 627 
Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 
195 200 205 

ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 67 5 
Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 
210 215 220 

acg cgt tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa 723 
Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 
225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 
Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly lie Asp Phe 
240 245 250 

aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 819 
Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala 
255 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 
Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp 
275 280 285 

cag aag aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa 915 
Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu 
290 295 300 



867 



963 



1011 



1059 



1107 



1155 
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Pro ser Gly Asp Ala Phe Ser Ser val Gly Thr His Arg Phe Val Gin 
370 375 380 

aag ccc tat agt cgc tct tec tec atg tct tec att gat eta gtc agt 1203 

Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val Ser 

385 390 395 

gec tct gat gat gtt cac aga ttc age tec cag gtt gaa gag atg gtg 1251 

Ala Ser Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val 

400 405 410 

cag aac cac atg act tac tea tta cag gat gta ggc gga gat gec aat 1299 

Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn 

415 420 425 430 

tgg cag ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa 1347 

Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu 

435 440 * 445 

gta gaa gaa aat ggg att gtt ctg gat cct tta aaa get ace cat gca 1395 

Val Glu Glu Asn Gly lie val Leu Asp Pro Leu Lys Ala Thr His Ala 
450 455 460 

gtt aaa ggc gtc aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt 1443 

Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val 

465 470 475 

gac gtt cgc aat gac tgg gaa aca act ata gaa aac ttt cat gtg gtg 1491 

Asp Val Arg Asn Asp Trp Glu Thr Thr lie Glu Asn Phe His val Val 

480 485 490 

gaa aca tta get gat aat gca ate ate att tat caa aca cac aag agg 153 9 

Glu Thr Leu Ala Asp Asn Ala He He lie Tyr Gin Thr His Lys Arg 

495 500 505 510 

gtg tgg cct get tct cag cga gac gta tta tat ctt tct gtc att cga 1587 

Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg 

515 520 * 525 

aag ata cca gec ttg act gaa aat gac cct gaa act tgg ata gtt tgt 1635 

Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys 
530 535 540 

aat ttt tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc 1683 

Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val 

545 550 555 

cgt gec aaa ata aat gtt get atg att tgt caa acc ttg gta age cca 1731 

Arg Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro 

560 565 570 



cca gag gga aac cag gaa att age 
Pro Glu Gly Asn Gin Glu He Ser 
575 580 

aca tat gta get aat gtg aac cct 
Thr Tyr Val Ala Asn Val Asn Pro 
595 

tta agg gca gtg gca aag cga gag 
Leu Arg Ala Val Ala Lys Arg Glu 



agg gac aac att eta tgc aag att 1779 

Arg Asp Asn He Leu Cys Lys He 
585 590 

gga gga tgg gca cca gec tea gtg 1827 

Gly Gly Trp Ala Pro Ala Ser Val 

600 60S 

tat cct aaa ttt eta aaa cgt ttt 1875 

Tyr Pro Lys Phe Leu Lys Arg Phe 
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610 



615 



620 



act tct tac gtc caa gaa aaa act gca gga aag cct att ttg ttc 
Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
625 630 635 



1920 



tagtattaac aggtactaga agatatgttt tatctttttt taactttatt tgactaatat 1980 
gactgtcaat actaaaattt agttgttgaa agtatttact atgttttttc cggaattc 203 8 



<210> 24 
<211> 637 
<212> PUT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBP-D169A 
<400> 24 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 " 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
145 150 155 160 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
165 170 175 

Met Glu Thr Phe Arg Ala He Leu Cys Arg Gin Val Asp Thr Leu Gin 
180 185 190 

Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
195 200 205 

Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
210 215 220 
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Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
225 230 235 

Phe Pro His val Thr Pro Lys Gly He Asn Gly lie Asp Phe Lys Gly 
245 250 255 

Glu Aia lie Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu 
260 265 270 

Ser H<s Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
275 280 285 

Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr 
290 295 300 

Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro 
305 310 315 320 

Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe 
325 33 ° 335 

Asn Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin 
340 545 350 

Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser 
355 360 365 

Glv Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro 
370 3 75 3 80 

Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala Ser 
365 " 39° 395 

Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn 
405 410 415 

His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin 
420 425 430 

Leu Val val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu 
435 440 445 

Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys 
450 455 460 

Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val 
465 470 475 480 

Arg Asn Asp Trp Glu Thr Thr lie Glu Asn Phe His Val Val Glu Thr 

Leu Ala Asp Asn Ala lie He He Tyr Gin Thr His Lys Arg Val Trp 
500 505 510 

Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys He 

515 520 525 

Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe 
530 535 S40 
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Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala 
545 550 555 560 

Lys lie Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro Pro Glu 
565 570 575 

Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr 
580 565 590 

Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg 
595 600 605 

Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser 
610 615 620 

Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
625 630 635 



<210> 25 
<211> 12482 
<212> DNA 

<213> Homo sapiens 
<400> 25 

tcgatcattt ccctcttcat attcagtgta tattgcacag atctctcaac aacacagcca 60 
ttaaatagat attctccaag tgacacttac atcacacatg tttgagttta cgttacttgc 120 
aaacataggg aaagaaagat acatgggata aactggtgca tgagaaatga gatcttagca 180 
gttggttgaa ataaatgaga acaactgagg caaactaaag aggaagaagg gcaagtggca 240 
gcttaacagg agtaagatga tgagatgaag ggcagaatac cttcatggag aggaggcaaa 300 
gagatataca tgatatgttc ttaggaacat aactgaagca aacaatgata tcatttctaa 360 
ttatatataa acctgtgagt cagccttcca ggggcggcct gctaaggtag aatcattgga 420 
atgacttggc cagggtttgg ataggagaga attggcagca gcgttaagat tgacccatga 480 
taaataatgc tatgcaggca gcagggagtc tgactaggag caaaatcaac gaacttatcc 540 
cttgcctaac atagtatctg tggagtcaga aagaagaggt taaattggga tatctgaggc 600 
aagtatcagg atttgccatg tctgcggagt agtttcataa ttctaatggt tataagcact 660 
aaggcgttca ctaagtgaat gttggtagtt ccaggttata ttatccattc ttgagttaca 720 
aaatacactt taaaaccttc ccatcttaat attatatgtt tttttagtca cagagtgaaa 780 
aggtgagatt acattggcct acatccttgc cctctggaga tgccttttct tctgtgggga 840 
cacatagatt tgtccaaaag gtaagctaat gtcagagttt actaaaagta caccttgtat 900 
tgttcttcat tgfctggfcgga aatatctttt attcgagacg gagtctcact ctgtcaccag 960 
agtggagtgc agtggcgcga tctcggctca ctacagtctc cacctcccgg gttcaagaga 1020 
ttctcgtgcc tcagcctccc tggtagctgg gattacaggc atgtaccacc acacccagct 1080 



55 



WO 00/50607 PCT/IBOO/00324 

aatttttgta tttttaatgg agacagtttc accacggcca ggacggcctt gatctcctga 114 0 
ccttgtgatc cacccacctc agcctcccag agtgctggga ttacaggcgt gagccaccat 12 00 
gcccagccgg aaatatcttg tagtacataa gttttctccc cttttcatta atttaagtaa 1260 
tgagactgtt tttggtttta tatattgtat tccatataca tcctccaaaa cagttagaaa 1320 
ttttgttctg aaaataaagt tctttcattt ttatttaagg ggaaagttgg gggtgggcaa 13 80 
ataaggagtg gctagtccaa aatagttaac cagaagtata tccagttata ctaaatctct 144 0 
ctcttctttg gggttaaatg gtattacttt gtattattgg aagcactaca ttcttttttg 1500 
gaatgatttt ggaacacaat acataatagg tgcatgaagt cagcagttgc tgctgtgctt 1560 
gtttcatata gtgctttgtt ttctcttccc tttatcttgt gtttggaagt tggtactgaa 1620 
tgctctgctg tgcctttgtt ctgattactt ggttttttct ttgtctgtct ccggtagccc 1680 
catagtcgct cttcctccat gtcttccatt gatctagtca gtgcctctga tgatgttcac 1740 
agattcagct cccaggtact gtatgaatgt atagagtgga cttgagtctt tctgtgctat 1800 
atttcagcct gctttcccag ttcctagaaa tcfctttggtt aggccactga ttttagtttt 1860 
gaattttaaa tagtaacatt aagcattaaa aaggtcttcc ttgtctacta aatagttcct 1920 
ctgtcaggtt tgcatgtgtc ctttactatt cacagcttgg aattctgtca tataggaggt 1980 
actccagaaa gattttcaaa ctgaattgaa acaaatagaa gatactgggt tttgtatatc 2040 
atgtaatatc tgtttcttca gtcaggattt agcagttttg atggacgtgg tccatatgat 2100 
atgttatagc agaaaagcag atttttacaa gtctcacttt aaagcctaaa gtacccccaa 216 0 
ttaatattca acaaggaaat cactttttaa taatatgttt catttccatt ataatactaa 2220 
gctctattga gcagattgtg ttttccttat gcaaactacc tttggatatt ataaatgaat 2280 
atttctgttc atatgctaaa tctatggaaa tttgttttaa tttttagcat tggtaagggt 234 0 
ttaggaattt aagacaggaa gctggatgct tgcggtctct aaagtctgta ccctcaaaat 2400 
aaaatcagat taccattgga agaagttttt tttagtgtca gcgttagttc tttttttaat 2460 
tttcttaatc ttcacatctt tgccattcaa ctttttatcC ttctggtgat tgcattttat 2520 
tggactagat tatattatgt taatcttata ttaaagacct gagcactctg gtcagaatga 2580 
ctcagtttaa accctggtta ggtgtatgat cccagtaagt tttctaactt ttttgtgctt 2640 
catttttatg atttagctag aacctgacac ataataagtg ctcaataaat gttaccttgt 2700 
attgctatta taacataatt tctttgagct aataaaagtt atctacatca ttattttttc 2760 
ctctgtgaga gtattgctat aaaagttttt aaaagtcata gtttaaagag atttctatta 2820 
tttttatgtt tataaataaa gtttacatta gtttttaacc tgcaatagag aagaatatta 2B80 
agactttaat ttttccgact cgtacagcgt ttttctccct gaacactctt aagaaaaaga 2940 
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tttagcaatt ctggaccaga aatcatccat 
ttgcttgtcc atttatgcat ttttttttaa 
tgctctgttg cccaggctgg agtgcagtgg 
cccaggttca agcaattctc ctgcctcagc 
ccactatgcc cagctaattt ttgtattctt 
ggctggtcta gcactcctga cctcgtgatc 
ttacaggtgt gagccaccat gcccggccct 
ataatgaatc tgtgacaaaa ctacataata 
ccacattgca ctaaaatcat atcacagact 
aatgaaagaa aatataatga cagcaaagaa 
agaattaggg gttcagaacc caaaccaggg 
ttgcttttaa gtccaggtaa ctatatcact 
tttgctgaca gcttacttct tcgtaatatt 
cttgagagaa atcttgctgg atttttttct 
cctgttttag cctttgatta tgtactgttt 
ataacattaa ggtcttcaag tatttgttgt 
acgtggactc tagcaattca ctgttgtaat 
tttataatag atgactaatt cccaattgat 
gccttatcag ccattatcaa aggatctagc 
tatagcattg gatgtagatc tggagttaat 
aactcttcca gttcaacaca gtgtaaaatt 
ttacagttct cacaccctat agagactttt 
ttcattaaca tcagaaacac ttatgttata 
tttttttttt ttttttgaga cggagtttcg 
acgatcttgg ctcactgcac cctctgcccc 
acctgagtag ctgggattac aggcacctgc 
tagtagagat ggggtttcac catgttggcc 
atctgcccgc ctcggcctcc caaagcgctg 
ctacaccgtt cttaatacac aaatatacat 
taaccaaatt ctttgtttta taatatcttc 



aaccaaatat accacagtat attttaccrt 3000 
ttttacttat ttattttcga gacagggtct 3060 
cacgatctgg gctcactgca acctccatct 3120 
ctcccaagta gctgggatta caggcacgca 3180 
agtaaagacg ggttttcacc atgttggcca 3240 
tgcccacctc ggcctcccaa agtgctggga 3 300 
gcgtatgttt ttaaaaagag actcatattc 3 360 
ctgggagact ttggtttatt gtgctaagct 3420 
aatcaaaaat gcaggaatac ataggctata 3480 
agaatgtaag ccagtaataa agaatgccca 354 0 
ccctcactgt agtgctgtag aacagctgaa 3600 
gagaagcagg tgcctatatt tttacaaaat 3660 
aatacccttt tgtaaaactc atgtatgtaa 3 72 0 
ctaatatatg gtgctcatga ttgatcagat 3780 
tatatgccag aagaggtaaa aatgaagaaa 3B40 
ccttgctaaa gcattagttg tcattagcag 3 900 
taaattgtgt gccttatgtt cagcagttcc 3 960 
aagatttttt gtttcagagg atgttacact 4020 
aagttgattc tgtatagtca cacttgagaa 4080 
attagttgag aaacattgtg ttatctggaa 4140 
atagtagtga ctacacagta gtgttacatt 4200 
gtattaacaa aataagaggc tcaaaggtta 4260 
ttacattgca tcggtctttt ctgttttttg 4320 
cttttgttgc ccaggctgga gtgcaatggt 43 80 
ctggattcaa gcgattctct tgcctcagcc 4440 
caccacaccc agctaatttt ttttcatttt 4500 
aggctggtct cgaactcctg acctcaggtg 4560 
ggattacagg catgagccgc cacacctggc 4620 
cagttactcc acagcgcttg atatgggagg 4680 
ataattaatt aaaaaactaa gtcgacattt 4740 
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ttaatcacct 


ttaataattt 


gccaaaatat 


tatataagca 


taatataatc 


aattcttact 


4800 


tactccaaca 


aattttaaaa 


gtccagatac 


agataccata 


tctagtttct 


tgatcattta 


4860 


tatcagctcc 


catacagaag 


ccttctaaat 


ctctggtaat 


ttcactttgc 


tgtttatata 


4920 


ay eg c uyg CI 


catgac t acc 


ttgttcttct 


tgaaatgatg 


tttcatagcc 


ttgaattggc 


4980 




aagugtacaa 


ttgagagatg 


ccctgaaaac 


agcttaaaat 


aaaatatgta 


5040 


catctactay 


gaaauuagua 


ccaacacatg 


aatctgtctg 


atgggcagat 


attaggaatg 


5100 


aagtcactcc 


agat ctgaga 


aat taaagtt 


gtaaaggact 


gcaagttctg 


tgtttttgtt 


5160 


gttgttgtcg 


tcgttgttgt 


tgtttgtttt 


ttcatttttg 


ttttttgggt 


ttttttgaga 


5220 


cagagtctca 


ttctgtcacc 


caggctgtag 


tgcagtggca 


cgatctcaac 


tcactgcaac 


5280 


ctccgtctcc 


caggttcaag 


cgattctcct 


gtctcagctg 


ggattacagg 


cacacgctat 


5340 


cacacccagc 


taatttttgt 


atttttagta 


gagacagggt 


ttcaccatgt 


tagccaggct 


5400 


ggtctcgaac 


tcctgacctc 


aagtgatctg 


cccgtctcgg 


cctcccaaag 


tgctgggatt 


5460 


acaggcctga 


gacaccatgc 


ccagcatttt 


tttttttttt 


tttttttttt 


gtaaagagac 


5520 


aaggtttcac 


ttgtccaggc 


caagtgcagt 


ggcatgatca 


tagctctgta 


acctgacctc 


5580 


tgacctctga 


cttcctggac 


acaagtgatc 


ctcctgtctc 


tcagcctccc 


aagtagctgg 


5640 


gactacaggc 


attccaccac 


acccaactaa 


ttgtttttat 


tttttgtaga 


gacagggcct 


5700 


tgctatgttg 


cccaggctgg 


caagttcttg 


aaataatggc 


tgtggccaca 


aactagaaaa 


5760 


taattttcag 


gtgtacagag 


aatagaaaga 


atttagattc 


ataaattgat 


cattttgttc 


5820 


acagttattt 


gcataacaca 


gttcacattt 


aaaggtgtca 


ccttagaaat 


caaaggggaa 


5880 


gaacatcatc 


ctctattgaa 


aaagaaagaa 


atcaaaggat 


gtacagtgaa 


tttgcagctt 


5940 


aatctatggg 


gagcatcatt 


gcaaaaaatg 


gttctgtgtg 


aggctctttc 


ccaccctttg 


6000 


tccataggag 


cacattattg 


ttgtagtaat 


tatttcaccc 


ctctcccttt 


ttcagtgtac 


6060 


aagtgataca 


tgctaatttt 


aacagaactt 


gaaagtagaa 


taaaattaaa 


ataatagttt 


6120 


actaatattc 


catttatctt 


ctctcatata 


tatgagataa 


atattaaggt 


gtatgtactt 


6180 


atccatatgt 


gcctgatttt 


tcaaaatcct 


tgtatatgca 


tctttgcacc 


cttatctaat 


6240 


tatttcctta 


gaatatattc 


ctagaagcat 


aattgtggga 


acaaaggcca 


tgaacatttt 


6300 


caagtgttta 


ttttattatt 


ttattttatt 


tttattaatt 


ttgatacagg 


gttttgcttt 


6360 


gttccccaga 


ctggagtgca 


gtggtgagat 


caccactcac 


tgcaccttga 


cctcctggac 


6420 


tcaagcgatc 


cacctgcctc 


agtctcctca 


gtagcggggg 


ctaaggacta 


caggcacatg 


6480 


ccatcatgcc 


cagctaattt 


ttttatttgt 


agcagagacg 


aggtctcact 


gtgttgccca 


6540 


ggctgctatt 


ttatttattt 


tttaagagat 


agggtctcat 


tctgtcttcc 


aggctagaat 


6600 
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gcagtggcac aatcatagct cactgcaacc tcaagcgatc tttgcctcag cctgagtagc 6660 
tgggactaca ggcatgggcc accactctca gctaattttt ttttcaattt tttatttttt 6720 
gtagatatgg gggtctcact gtgttgccta ggctggtctt gaacccctag cctaaagtga 6780 
tcttcccacc tcagcctccc aaagtgctag gattacaggc cacaggcctc agccaagttt 6 840 
taaaaatttt tactgccaaa ctcttcatta gaaaagttga accagcttac attcccaggc 6900 
cagttttcta ttgatatagt agcactgaat attataattc agttaacttt tgtcaatacg 6960 
gtaggctaaa agtgctatgt tcttagccat ctctcttttg ggttaacagt gcactatttt 7020 
gttattaata attattctat ctaacaagcc ccctctatgg ttttgtggct ttgtagtaag 7080 
catagttgta tttccttttt tgaggtggag tcttgctatg ttgcccaggc tggagtgcag 7140 
tggcgcgatc tcggctcact gcaccctccg cctcccgggt tcaagtgatt ctcctgcctc 7200 
agactcctga gtatctggga ctacaggcat gcaccaccac gcccagctaa ttttttatat 7260 
ttttagtaga gaggggagtt caccgtgtta gccgggatgg tctctatctc ttgacctcgt 7320 
ggtccgcgtg cctcagcctc ccaaaatgct gtgattacag gcatgagcca ccctgcctgg 7380 
ccaacatttc ttttacatgc ataaaagaga tctgagctgt ttttgagccc ttctagactt 7440 
tctttttttt tttttttttt tttttttttt tttttttttt tcttttttaa gtagatgagg 7500 
tcttgctatg ttgccgagac ttaacctcaa actcctaggc ccaagcaatc ctcccaagct 7 560 
gctgggacta caggcatgaa ccaccatgcc caacttagac ttttattgta ctatcaaaag 7620 
gcaattttct tttcaaattt ctgggtaata gtgttagaaa aatcctactt ggtaacatcc 7680 
agaaatggca tcatactgag tgattcaaat gtgagatgga agaaaaggtt agaattggag 7740 
tgaacgtccc ctcttatctc aaatgtattt tatctccatt ttgtttcata gtttattagt 7800 
ttgaagatgc tttgaatgtc acctaatcat tttcaactct aggtccagaa aaatcaaggg 7860 
catgatttct gaaattacac ttagcctaat taaaacttag aaacactgtt caccttcttc 7920 
aatgtttttg actgagtctt tttcatttat aagtgacagg aggtgttact ataacattat 7980 
ttcctagaat gtcaaatttt gagcctaata gcatggtaaa tttggctata tttgttgttt 8040 
tttgtttttg tttttttttt aatgaaactt agtatttcct tgtttcccac ttcttttttt 8100 
tttttttttt tttttttttt tgagacggag tctctctctg tcatccaggc tggagtgcaa 8160 
tggcgtgatc ttggctcact gccacctccg cctcgcaggt tcacgctatt ctcctttcac 8220 
agcctcctga gtagctggga ctacaggcac ccaccaccac gcccggccaa tttttttgta 8280 
tttttagtag agacggggtt ttaccatgtt aggcaggatg gtctcgaact cctgaccttg 8 340 
tgatctgccc gcctcagcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 8400 
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gcctcccact tctttttaat atgtcgtgtc acaaccgaac agtaaagtga gcagattatc 8460 
aggttaaatc tgaagtgtca gtctggtcac cagtgcccaa gttactgccc ctatggtaat 8520 
attggttact ttgtattttc ctacagcaaa cataaaattt gttatagtga gatttttacc S5 80 
tgtatacctc tcttaacttt aatgttatta cctcaaggaa gatattatca tgaatgaaga 8640 
ttccatgatg aaagttttgc agagtttatt gcagtaattt agcacttcat tagaatcttt 87 00 
agttttttag gagcacagta ctgaatgctt gtttctttgt tggacctttt gaaaaccggt 87S0 
tttccattga tgcagtgtag ctgttacagg aatatcattt ttaaaacgtt tttatacagc 8820 
atggctgaaa attgaacctg ggcctccctc gtggcctacc attgaaggaa cagcattttt 88 80 
tgcctatcta gaaagacaat gttaaatgtg ctatctatat attttttaac ttgtgctacc 8940 
tactacgcgt ttatatttgt ggaatctgtt ttcttttgga caaaaccaca aatcaaaaac 9000 
acctcatttc ttaggcattt gaaatcccta attcagaata atctcccaaa cagaaacaca 9060 
actacctgca ttctttttga caaaagagct aagtagcatt agaaaattat tttaaaccca 9120 
attctgtttt ttaacagaat aaaattcttc tgttcttcac attcttcttt cataggtaac 9180 
ctattgaaag tagggtttat ttgggggaag catttctttc tgtctcttat ctcataataa 9240 
atacaggtgt gcttaactac tagtttccta cctcaaagat atactcaaat ctaaagatgt 93 00 
ttaagatttt gggatctgaa gagtaaacat ttctcctaat cacaatgtga cagagacaaa 9360 
tgaatcaagc caatgctact tttatttatg catactaact ggaacttttc tttttggaaa 9420 
tcagatacat tttgcatgta ttagtaattt ggaaccctgc attggttatc ctcgccctcc 9480 
caaagcagat tctgaaatta taaaggtgca caggttctcc atgcaacacc aaaagttata 954 0 
ttttccaagg ctttgtaaaa ttgtagaatg tcctgttaaa tttctgtcaa atcagtaact 9600 
cacaccgtcc tgagaattat gaataaagga ataaaatatt gttagtgttt atttagtaca 9660 
aaagtagatt atagaatctc agcatttttg tcaaaaaatt tctttttgat gattgacaga 9720 
tcaggagaca cttaaggcca tacctgcttt cagtaatcaa aaatgcattt aagatccaga 9780 
aacttgaggt agcagaacac cactatcaca tataacatat cctttggtat agaaaattat 9840 
attcccagag tgagtttctt ttttaaaacc attaatgagg ccaaggtggg aagatcactt 9900 
gggaccagga gttcaagacc aagcctgggc cagatggcga gaccctgtct ctacaaaaaa 9960 
ttaactggat gtggtggtgc actcctgtag tcccacctac tcagaggctg aggcaggagg 10 02 0 
accccttgag cccaggaaat tgtagtggca gtgagctatg atcatactac tgtactgcag 10080 
tctgggccac gaagtgagac cgtgtctctt aaaaaaaaaa aaatgttagg catggtggca 10140 
caggcatata gttttagcta cttaggaggc tgaggcagga ggatcacttg agcccagaag 10200 
ttcaagatta cagtgagtta tgattgtgcc gctgcactcc aacctgggtg acaaaataac 10260 
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cctgtctctg 


gcgggtaggg 


gggaagttga 


ttatttactt 


tgaaatatgt 


tcaaaactga 


10320 


ttcctgttct 


atattcctaa 


tgaacagaat 


agactttata 


taaaacaaat 


agttaaactt 


10380 


aaggataaaa 


ttttaatgga 


agtataatat 


atatatcttc 


cagctcttct 


gtcttctaac 


10440 


gtatttatta 


cagaaaatga 


aattactttg 


tttccgcaat 


ctttgtatca 


cttcagttct 


10500 


ccaataaatc 


cgagaattct 


ggtagtgtga 


aatattcagc 


tttctttgct 


tatttacata 


10560 


aaatgtataa 


ggacaatttg 


tgataattaa 


gagttacatt 


taaatatcag 


gaaaaagtta 


10620 


taaatttaaa 


ttaaaaaatt 


ttaaaaggaa 


attattagaa 


attttaaaag 


aatgaactaa 


10680 


aaggtgacta 


tatgtaaatg 


cttgcatata 


tgaatattag 


cattgtcccc 


aaaataattt 


10740 


agaacaaaga 


aattggaatc 


aaataaataa 


aggtttgatt 


atttttaaat 


tggcttatat 


10800 


tccatgataa 


aagagaggtt 


tatcagtggc 


ataagaaagg 


tttttcacct 


tttttgtatt 


10860 


gaaatctttg 


acatatacat 


atatatcttt 


gctcatcttt 


gtgtatcttt 


gctcgtatga 


10920 


gagcaaagat 


ataggcaaag 


atatgctctc 


tctctctatg 


tctttgttca 


taccaagacc 


10980 


ttcctgatat 


ctccacataa 


tcttaaatat 


aggaacatta 


gactggatga 


tctctgtgcc 


11040 


ccctttatct 


ctactcttcc 


attattttat 


actttaacac 


atcatctctg 


ttttatgata 


11100 


taagaatgga 


atatttcttt 


tttcctgaaa 


afcgcttattt 


tggtcacttg 


atacacatta 


11160 


ggccaatatg 


tgttacttga 


gtgacccatc 


ttccttcctt 


tcatttctgt 


ctcctgtcat 


11220 


taacctggat 


atctggaatg 


tggactaaac 


tcttcaaaca 


ctatgtaaaa 


cctactaacc 


11280 


tttgtgcatt 


tggttgctca 


gctactaaga 


gcaccatttc 


tgaactgaag 


ttaactgaag 


11340 


accattctgt 


tttagagatt 


atgacatacc 


ttttggattc 


tcatgccttt 


ttcctccctt 


11400 


ctcaaggttg 


aagagatggt 


gcagaaccac 


atgacttact 


cattacagga 


tgtaggcgga 


11460 


gatgccaatt 


ggcagttggt 


tgtagaagaa 


ggagaaatga 


aggtaattcc 


ccctgaaatg 


11520 


ttatagattg 


ccaaaggcgt 


ctctgtttca 


gtcatattat 


cattactatt 


gatatgaata 


11580 


aggatagcac 


tttcaactta 


cctttaaaac 


aaattattac 


atgtgatcaa 


agcagtacca 


11640 


tatattgagc 


aataaaatgt 


ctttttgctt 


ttctggcttt 


gcctttacta 


aaggttttta 


11700 


tgattataat 


ataaatatat 


gattaaacct 


ttctgttttg 


actaggccat 


gaagaaaata 


11760 


aaatttagag 


aattagatat 


gaccaggtca 


caattagctg 


atggtcctgt 


atttggatat 


11820 


ttccttttgt 


tttgtttttt 


taacatactg 


aatgttgtgc 


ctagatgaca 


ctttgtttct 


11880 


ctcccttttt 


ggtctatacc 


ctccttcttt 


tcccttctct 


tactgcacct 


tcaattgata 


11940 


cttggacact 


ggtcagttaa 


tcctggttac 


atccctaaac 


acatggacag 


aaaataagag 


12000 


cagggactga 


gagatacaga 


gatggattga 


aaagcaaaag 


caacattgaa 


ttttggattt 


12060 
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tctcattcct aaggaactat gctaaataaa gatacaaaga taataagaca ctctccaagc 1212 0 

taaagcttta gttaaggaaa aagaatactg acatttaaaa gatactattg gccaggcaca 12180 

gtggctatgc ctgtaatccc agcactttta ggaggacatg gcaggcggat tacttgagct 12240 

caggagttca agtcaaacct gggcaacacg gtgaaacccc gtctctacca aaaatacaaa 12300 

aattagctgg gtgcagtacc acacacttgt agtcccagct acccaggagg ctgggcaaaa 12360 

gattccttga gccagggagg tcaaggctgc aatgagccgc gtttgtgcca ctgcactcta 12420 

gcctgggtca caaagtgaga ccctgtgtga gatatatata tatatatata tatatatata 12480 

ta 12482 



<210> 26 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: GPpepl 
<400> 26 

Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr Thr Arg 
15 10 15 

Gly Phe Val Phe Thr 
20 



<210> 27 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPpeplAla9 
•<400> 27 

Lys Gly Lys Arg Gly Asp Ala Gly Ser Pro Ala Thr Trp Thr Thr Arg 
15 10 15 

Gly Phe Val Phe Thr 
20 



<210> 28 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-54m 
<400> 28 

tcgaattcac catggcccca ctagccgact acaaggacga cgatgacaag 50 
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<210> 29 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-55c 
<400> 29 

ccgagcccga cgagttccag ctctgattat ccgacatctt gtcatcgtcg 50 

<210> 30 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 0N-HNC-B-N-14m 
<400> 30 

cgggatccgc tagctaagcc aggcaaggat gg 32 

<210> 31 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-HNC-B-N-16C 
<400> 31 

cgggatccat gcataaatag cagttctgct gt 32 

<210> 32 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG peptide 
<400> 32 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 33 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Hypothetical 
peptide 

<400> 33 

Pro Arg Ser Ala Arg Cys Gin Ala Arg Arg Arg Arg Gly Gly Arg Thr 
1 5 10 15 
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Ser. Ser 



<210> 34 
<:211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-llm 
<400> 34 

gcgggactca gcggccggat tttct 



<210> 35 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<:223> Description of Artificial Sequence: ON-GPBP-l5m 
<400> 35 

acagctggca gaagagac 



<210> 36 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-20C 

<400> 36 

catgggtagc ttttaaag 

<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of /Artificial Sequence: ON-GPBP-22m 
<400> 37 

tagaagaaca gtcacagagt gaaaagg 

<210> 38 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-53c 
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<400> 38 

gaattcgaac aaaataggct ttc 



23 



<210> 39 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-56m 
<400> 39 

ccctatagtc gctcttc 17 



<210> 40 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-57C 



<210> 41 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-62C 
<400> 41 

gtggttctgc accatctctt caac 24 



<210> 42 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-26 



<210> 43 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPIII derived 
peptide 



<400> 40 

ctgggagctg aatctgt 



17 



<400> 42 

cacatagatt tgtccaaaag gttgaagaga tggtgcagaa c 



41 
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<400> 43 

Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu Phe Val Lys Val Leu 
15 10 15 

Arg Ser Pro 



<210> 44 

<211> 14 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPIII-IV-V 
derived peptide 

<400> 44 

Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu Phe His Gin 
15 10 



<210> 45 
<211> 685 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDV 

<220> 

<221> CDS 

<222> (1) . . (633) 

<:400> 45 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 48 
Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 96 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

ttt gta caa gga aat caa cga gec cac gga caa gac ctt gga act ctt 192 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Gly Thr Leu 
50 55 60 

ggc age tgc ctg cag cga ttt acc aca atg cca ttc tta ttc tgc aat 240 
Gly Ser Cys Leu Gin Arg Phe Thr Thr Met Pro Phe Leu Phe Cys Asn 
65 70 75 80 

gtc aat gat gta tgt aat ttt gca tct cga aat gat tat tea tac tgg 288 
Val Asn Asp Val Cys Asn Phe Ala Ser Arg Asn Asp Tyr Ser Tyr Trp 
85 90 95 

ctg tea aca cca get ctg atg cca atg aac atg get ccc att act ggc 336 
Leu Ser Thr Pro Ala Leu Met Pro Met Asn Met Ala Pro He Thr Gly 
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100 105 110 

aga gcc ctt gag cct tat ata age aga tgc act gtt tgt gaa ggt cct 3 84 

Arg Ala Leu Glu Pro Tyr He Ser Arg Cys Thr Val Cys Glu Gly Pro 
115 120 125 

gcg ate gcc ata gcc gtt cac age caa acc act gac att cct cca tgt 432 

Ala He Ala He Ala Val His Ser Gin Thr Thr Asp He Pro Pro Cys 
130 135 UO 

cct cac ggc tgg att tct etc tgg aaa gga ttt tea ttc ate atg aaa 480 

Pro His Gly Trp He Ser Leu Trp Lys Gly Phe Ser Phe He Met Lys 
145 150 155 160 

gcc tat tec ate aac tgt gaa age tgg gga att aga aaa aat aat aag 528 

Ala Tyr Ser He Asn Cys Glu Ser Trp Gly He Arg Lys Asn Asn Lys 
165 170 * 175 

teg ctg tea ggt gtg cat gaa gaa aag aca ctg aag eta aaa aag aca 576 

Ser Leu Ser Gly Val His Glu Glu Lys Thr Leu Lys Leu Lys Lys Thr 

180 185 * 190 

gca gaa ctg eta ttt ttc ate eta aag aac aaa gta atg aca gaa cat 624 

Ala Glu Leu Leu Phe Phe He Leu Lys Asn Lys Val Met Thr Glu His 
195 200 205 

get gtt att taggtatttt tctttaacca aacaatattg ctccatgatg 673 
Ala Val He 
210 

acttagtaca aa 685 



<210> 46 
<211> 211 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDV 
<400> 46 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Gly Thr Leu 
50 55 60 

Gly Ser Cys Leu Gin Arg Phe Thr Thr Met Pro Phe Leu Phe Cys Asn 
65 70 75 80 

Val Asn Asp Val Cys Asn Phe Ala Ser Arg Asn Asp Tyr Ser Tyr Trp 
85 90 95 

Leu Ser Thr Pro Ala Leu Met Pro Met Asn Met Ala Pro He Thr Gly 
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100 



105 



110 



Arg Ala Leu Glu Pro Tyr He Ser Arg Cys Thr Val Cys Glu Gly Pro 

115 120 125 

Ala He Ala He Ala Val His Ser Gin Thr Thr Asp He Pro Pro Cys 
130 135 140 

Pro His Gly Trp He Ser Leu Trp Lys Gly Phe Ser Phe He Met Lys 

145 150 155 160 

Ala Tyr Ser He Asn Cys Glu Ser Trp Gly He Arg Lys Asn Asn Lys 

165 170 175 

Ser Leu Ser Gly Val His Glu Glu Lys Thr Leu Lys Leu Lys Lys Thr 

180 185 190 

Ala Glu Leu Leu Phe Phe He Leu Lys Asn Lys Val Met Thr Glu His 

195 200 205 



Ala Val He 
210 



<210> 47 
<211> 680 
<212> DNA 

<213> Artificial Sequence 
<220> 

<222> Description of Artificial Sequence: GPDXII 

<220> 

<221> CDS 

<222> (1) . . <216) 

<400> 47 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 48 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 



acg aga ggc 
Thr Arg Gly 



tea tgt 
Ser Cys 



ttt gta caa 
Phe Val Gin 
50 



ttt gtc ttc 
Phe val Phe 
20 



acc cga cac 
Thr Arg His 
25 



gga caa gac 
Gly Gin Asp 
60 



ctt gat 
Leu Asp 



cct 96 
Pro 



ctt 144 
Leu 



ctg 192 
Leu 



agt caa acc aca gca att 
Ser Gin Thr Thr Ala He 

30 

cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt 
Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe 
35 40 45 

gga aat caa cga gec cac gga caa gac ctt gat gca 
Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala 
55 60 



ttt gtg aag gtc ctg cga teg cca tagcegttea cagccaaacc actgacattc 246 
Phe Val Lys Val Leu Arg Ser Pro 
65 70 

ctccatgtcc teaeggctgg atttctctct ggaaaggatt ttcattcatc atgttcacaa 306 

gtgcaggttc tgagggcacc gggcaagcac tggcctcccc tggctcctgc ctggaagaat 366 
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tccgagccag cccatttcta gaatgtcatg gaagaggaac gtgcaactac tattcaaatt 426 

cctacagttt ctggctggct tcattaaacc cagaaagaat gttcagaaag cctattccat 4 86 

caactgtgaa agctggggaa ttagaaaaaa taataagtcg ctgtcaggtg tgcatgaaga 546 

aaagacactg aagctaaaaa agacagcaga actgctattt ttcatcctaa agaacaaagt 606 

aatgacagaa catgctgtta tttaggtatt tttctttaac caaacaatat tgctccatga 666 
tgacttagta caaa 6 BO 



<210> 48 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII 
<400> 48 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

Phe Val Lys Val Leu Arg Ser Pro 
65 70 



<210> 49 

<211> 392 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-IV-V 

<220> 

<221> CDS 

<222> (1) . . (207) 

<400> 49 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 48 
Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 96 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
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35 

ttt gta caa 
Phe Val Gin 
50 

ttc cat caa 
Phe His Gin 
65 

gtgcatgaag aaaagacact gaagctaaaa aagacagcag aactgctatt tttcatccta 307 
aagaacaaag taatgacaga acatgctgtt atttaggtat ttttctttaa ccaaacaata 367 
ttgctccatg atgacttagt acaaa 392 



<210> 50 

<211> 68 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: GPDIII-IV-V 

<400> 50 
Gly Leu Lys Gly 
l 

Thr Arg Gly Phe 
20 

Ser cys Pro Glu 
35 

Phe Val Gin Gly 
50 

Phe His Gin Leu 
65 



<210> 51 

c211> 507 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: GPDIII-V 

<220> 

<221> CDS 

<222> (1) . . (216) 

<400> 51 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 4 8 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 96 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 



40 45 

gga aat caa cga gec cac gga caa gac cct gaa age eta 192 
Gly Asn Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu 
55 60 

ctg tga aagctgggga attagaaaaa ataataagtc gctgtcaggt 247 
Leu 



Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
5 10 15 

Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
25 30 

Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
40 45 



Asn Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu 
55 60 
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20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

ttt gta caa gga aat caa cga gee cac gga caa gac ctt gat gca ctg 192 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

ttt gtg aag gtc ctg cga teg cca tagcegttea cagccaaacc actgacattc 246 
Phe Val Lys Val Leu Arg Ser Pro 
65 70 

ctccatgtcc teaeggctgg atttctctct ggaaaggatt ttcattcatc atgaaagect 306 

attccatcaa ctgtgaaagc tggggaatta gaaaaaataa taagtcgctg tcaggtgtgc 366 

atgaagaaaa gacactgaag ctaaaaaaga cagcagaact gctatttttc atcctaaaga 426 

acaaagtaat gacagaacat gctgttattt aggtattttt ctttaaccaa acaatattgc 486 

tccatgacga cttagtacaa a 507 



<210> 52 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence*. GPDIII-V 
<400> 52 

Gly Leu Lya Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

Phe Val Lys Val Leu Arg Ser Pro 
65 70 



<210> 53 
<211> 659 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HMBP-21 

<220> 

<221> CDS 

<222> (37) . . (627) 
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<400> 53 

gaaaacagtg cagccacctc cgagagcctg gatgtg atg gcg tea cag aag aga 54 

Met Ala Ser Gin Lys Arg 

1 5 

ccc tec cag agg cac gga tec aag tac ctg gec aca gca agt acc atg 102 

Pro Ser Gin Arg His Gly Ser Lys Tyr Leu Ala Thr Ala Ser Thr Met 
10 15 20 

gac cat gec agg cat ggc ttc etc cca agg cac aga gac acg ggc ate 150 

Asp His Ala Arg His Gly Phe Leu Pro Arg His Arg Asp Thr Gly lie 
25 30 35 

ctt gac tec ate ggg cgc ttc ttt ggc ggt gac agg ggt gcg cca aag 198 

Leu Asp Ser lie Gly Arg Phe Phe Gly Gly Asp Arg Gly Ala Pro Lys 
4 0 45 50 

egg ggc tct ggc aag gta ccc tgg eta aag ccg ggc egg age cct ctg 246 

Arg Gly Ser Gly Lys Val Pro Trp Leu Lys Pro Gly Arg Ser Pro Leu 

55 60 65 70 

ccc tct cat gee cgc age cag cct ggg ctg tgc aac atg tac aag gac 294 

Pro Ser His Ala Arg Ser Gin Pro Gly Leu Cys Asn Met Tyr Lys Asp 

75 80 85 

tea cac cac ccg gca aga act get cac tat ggc tec ctg ccc cag aag 342 

Ser His His Pro Ala Arg Thr Ala His Tyr Gly ser Leu Pro Gin Lys 
90 95 100 

tea cac ggc egg acc caa gat gaa aac ccc gta gtc cac ttc ttc aag 390 

Ser His Gly Arg Thr Gin Asp Glu Asn Pro Val Val His Phe Phe Lys 
105 110 115 

aac att gtg acg cct cgc aca cca ccc ccg teg cag gga aag ggg aga 438 

Asn lie Val Thr Pro Arg Thr Pro Pro Pro Ser Gin Gly Lys Gly Arg 
120 125 13 0 

gga ctg tec ctg age aga ttt age tgg ggg gec gaa ggc cag aga cca 486 

Gly Leu Ser Leu Ser Arg Phe Ser Trp Gly Ala Glu Gly Gin Arg Pro 

135 140 145 150 

gga ttt ggc tac gga ggc aga gcg tec gac tat aaa teg get cac aag 534 

Gly Phe Gly Tyr Gly Gly Arg Ala Ser Asp Tyr Lys Ser Ala His Lys 

155 160 * 165 

gga ttc aag gga gtc gat gee cag ggc acg ctt tec aaa att ttt aag 582 

Gly Phe Lys Gly Val Asp Ala Gin Gly Thr Leu Ser Lys He Phe Lys 
170 175 180 

ctg gga gga aga gat agt cgc tct gga tea ccc atg get aga cgc 627 

Leu Gly Gly Arg Asp Ser Arg Ser Gly Ser Pro Met Ala Arg Arg 
185 190 195 

tgaaaaccca cctggttccg gaatcctgtc ct 659 



<210> 54 

<211> 197 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial 

iiClai Sequence: HMBP-21 

<400> 54 

Met Ala ser Gin Lys Arg Pro Ser Gin Ara His m « 

1 5 A [* Hls 01 V Ser Lys Tyr Leu 

Ala Thr Ala Ser Thr Met Asd Hi* m 9 * 

2o Asp hxs Ala Arg His Gly Phe Leu Pro Arg 

His Arg Asp Thr Gly U e Leu Asd S*t- ti 

35 6U Ser Gly Arg Phe Phe Gly Gly 

45 



Asp Arg Gly Ala Pro Lys Arg Glv Spr m • 

50 7 sf Gly Ser Q ly L/B Val Pro Trp Leu Lys 

60 1 

« ^ ^70 P - S " - - -| - Pro Gly Leu 

CVS - «e t ry* Lys as P Sftr His Hie ^ ^ ^ ^ ^ ^ J° 

Gly Ser Leu Pro Gin Lys 3er His Gly Arc Thr- r, . 

100 Thr Gin Asp Qlu Asn Pro 

110 

Val Val His Phe Phe Lys Asn n» -u 

115 78 ASn J£ Val T ^r Pro Arg Thr Pro Pro Pro 

125 

*' S Gly - "* - S - - 3« JJ s . r Irp aly 

«j ox. = ly «. Arg ,„ aiy Phe 01y ^ ^ m> up 

r 155 160 

Lys ser Ala His Lys Gly Phe L ys Gly Val Asn ai „, 

165 7 vai As P Ala Gin Gly Thr 

175 

Leu Ser Lys lie Phe Lys Leu Gly Gly Ara Asn c 

180 Y 9 P Ser Ara Ser °ly Ser 



190 

Pro Met Ala Arg Arg 
195 
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